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PROBES FOR AND METHODS OF 
DIAGNOSIS FOR MUSCULAR DYSTROPHY 
Background of the Invention 
This application is a continuation-in-part of 
5 U.S. Patent Application Serial No. 890,694 which is 
incorporated herein by reference. 

The work reported herein was supported by 
grants from the United States government including 
N.I.H. grants HD 18658 and NS23740. The government has 
10 certain rights in this invention. 

This invention relates to the detection and 
. treatment of hereditary disease, and in particular to 
the detection and diagnosis of Duchenne, Becker and 
Outlier muscular dystrophies (MD) by various methods. 
IS Duchenne muscular dystrophy (DMD) is an 

X-linked recessive genetic disorder which affects about 
1 in 3,300 males. Traits associated with DMD (DMD 
phenotype) are well known and may include elevated 
creatine phosphokinase levels in serum (at least lOx the 
20 normal level), delayed development of motor function, 
and muscle weakness characterized by the replacement of 
muscle fiber with adipose and fibrose tissue accompanied 
by a marked variation in muscle size. Until recently, 
carrier identification in DMD families generally was 
25 accomplished by detecting elevated levels of creatine 
phosphokinase in serum. 

Becker muscular dystrophy (BMD) is also an 
X-linked recessive genetic disorder, but occurs at only 
10% of the frequency of DMD. BMD is a more benign form 
30 of muscular dystrophy which follows a less rapid 
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clinical course than DMD . Both DMD and BMD are caused 
by mutations in the DMD gene located in the Xp21 region 
of the short arm of the human X chromosome. 

Outlier muscular dystrophy (OMD) is a mild 
5 Duchenne/severe Becker disorder. It forms a subgroup 
that comprises approximately 10 percent of individuals 

inflicted with DMD. 

Restriction fragment length polymorphisms 
(RFLPs) occur when the genomic DNA sequence in a 
10 population of animals varies such that complete 

digestion with a restriction endonuclease will result in 
a set of restriction fragments of one or more different 
size arrays. The differences are due to the presence or 
absence in the DNA sequence of sites recognized by the 
15 endonuclease; a sequence in which the site is present 
will be cut by the endonuclease giving one array, and a 
sequence in which the site is not present will fail to 
be cut by the endonuclease, giving another array. 

It has been shown that RFLPs which are located 
20 near a gene associated with a hereditary disease can be 
used to determine the likelihood of a particular person 
having a defective gene and thus having the disease. 
Bakker et al.. The Lancet, March 23, 1985, at 655 
describes "eleven RFLP-markers . . . on the short arm of 
25 the X chromosome [that] are useful in the diagnosis of 
DMD since they bridge the Duchenne locus at genetic 
distances varying between 3 and 20 centiMorgans . " 

Summary of the Invention 
Applicants have cloned 14 kb of human cDNA 
30 subst ant i ally corresponding to the MD gene, and parts of 
the surrounding genomic DNA. Applicants have also 
isolated and characterized cDNA corresponding to the 
homologous locus in the mouse. Dystrophin, the 
polypeptide product of the human MD locus, and its mouse 
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homologue (mMD), has been identified using antibodies 
directed against fusion polypeptides containing two 
distinct regions of MD and mMD cDNA. Parts of the 
complete MD cDNA and surrounding genomic DNA were also 
5 used to define RFLPs in the Xp21 region of the short arm 
of the X chromosome that are useful in diagnosis of MD 
mutations . 

In a first aspect, the invention features an MD 
probe having a substantially purified single-stranded 

10 nucleic acid sequence capable of selectively hybridizing 
to a region of DNA on a human X chromosome between the 
deletion break point at Xp21.3 (defined below) and the 
translocation break point at X;ll (defined below). 
"Selectively hybridizing" means hybridizing to the 

15 desired sequence under conditions that are sufficiently 
stringent to generally prevent hybridization to other 
sequences encountered in the sample. "Substantially 
purified 1 ' or "substantially pure" means a substance that 
is sufficiently separated from other elements of the 

20 environment in which it naturally occurs to function in 
assays according to the invention. 

In preferred embodiments, the nucleic acid 
sequence is incapable of selectively hybridizing to 
other parts of the short arm of the X chromosome, i.e./ 

25 the region of DNA between the deletion break point at 
Xp21.3 and the p terminus of the X chromosome, and to 
the region of DKTA between the translocation break point 
at X;ll and the centromere of the X chromosome. 

In an additional preferred embodiment, the MD 

30 probe has a nucleic acid sequence of at least 10 base 
pairs in length from a portion of the sequence of any 
one of the six cDNA probes, that together span the I4kb 
cDNA corresponding to the human MD gene, deposited on 
July 15, 1987 with the A.T.C.C. under the twelve access 
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numbers 57666 to 57677, inclusive. Each probe is 
deposited in a vector in coli and in the form of a 
purified plasmid. Hence, each probe is available under 
two access numbers. 

5 In a. second aspect, the invention features an 

MD probe having a substantially purified single-stranded 
nucleic acid sequence capable of selectively hybridizing 
to at least a part of the human MD gene. The human MD 
gene, as used herein, is that portion of human DNA on 

10 the short arm of the X chromosome in which a mutation 

can give rise to an MD phenotype. Exons in this DNA are 
substantially represented by the 14 kb cDNA deposit 
discussed below. The sequence for most of this cDNA is 
depicted in Fig. 5. The invention also includes probes 

15 directed to or derived from genomic DNA, including 
intronic sequences in the MD gene. 

In preferred embodiments, the nucleic acid 
probe sequence is incapable of selectively hybridizing 
to regions of DNA outside the MD gene. 

20 The probe of the invention can be used for 

analyzing a sample of genomic DNA of a human subject for 
the presence of a mutant MD gene. 

In a third aspect, the invention features a 
method for diagnosing muscular dystrophy in human 

25 patients involving contacting the probes outlined above 
with a sample and detecting whether these probes 
hybridize to nucleic acid in the sample. 

In preferred embodiments, a series of probes is 
used in the method of diagnosis, in which the series, in 

30 combination, spans a 14 kb sequence of MD DNA or RNA, 
encompassing an MD gene. For example, probes according 
to the invention are used for detecting RFLPs that are 
closely associated (linked) with mutations in the MD 
gene causing an MD phenotype. "Closely associated' 1 
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means that the likelihood of a genetic recombination 
event occurring between the sites of an RFLP and a MD 
mutation is very low (less than 5%). The advantage of 
such probes is that one can use them to predict, with a 
5 high degree of confidence, the probability that an 

individual (within a family with a history of MD) having 
a particular RFLP phenotype will also have an MD 
mutation. 

In a fourth aspect, the invention features a 

10 substantially pure mammalian polypeptide, dystrophin, 
encoded by an MD gene. This polypeptide has both human 
and murine forms. The invention also includes 
substantially pure polypeptides that are immunologically 
cross-reactive with dystrophin, or which have 

15 substantially the same amino acid sequence as a 15 amino 
acid sequence of dystrophin. By "polypeptide" we mean 
to include large polypeptides and proteins. 
"Substantially the same" means that the amino acids in 
such a sequence may be substituted by other amino acids 

20 having equivalent side groups, i.e., conservative 
substitutions are permissible. 

Two polypeptides are immunologically 
cross-reactive if they are bound to the same specific 
antibody or family of specific antibodies. Particularly 

25 useful types of cross-reactive polypeptides are those 
that can be used to challenge an animal to produce an 
antibody that reacts with dystrophin. The invention 
also includes polypeptides that are useful for 
competitive assays, i.e., polypeptides that compete with 

30 dystrophin for binding sites on specific antibodies. 

In a fifth aspect, the invention features 
substantially purified nucleic acid encoding dystrophin, 
or polypeptides immunologically cross-reactive with 
dystrophin. 
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A preferred embodiment of this nucleic acid 
comprises an approximately 14 kb pair sequence of cDNA 
or RNA corresponding to a nucleic acid sequence of a MD 
gene, or fragments thereof greater than 45 bases. This 
5 nucleic acid may correspond to a human MD gene or a 

murine Dmd gene. 

In a sixth aspect, the invention features 
antibodies to dystrophin or polypeptides that are 
immuno logically cross-reactive with dystrophin. 

10 The invention also includes the use of these 

antibodies in a method of immunodiagnosis for MD 
involving contacting a sample of tissue (particularly 
muscle tissue) with one of these antibodies and 
detecting the presence of antibody-dystrophin 

15 complexes. Complexes that indicate the presence of 
dystrophin of normal size and abundance in such tissue 
generally indicate the absence of MD. This aspect of 
the invention includes the numerous well-known formats 
of immunodiagnostics, such as ELISA, and Western blots. 

20 In one embodiment, this method is performed by comparing 
the molecular weight (M.W.) of dystrophin, or fragments 
thereof, that are detected in the complex to the 
molecular weight of a normal control dystrophin 
polypeptide. In this method, the presence of fragments 

25 of dystrophin, i.e., dystrophin of abnormal size, or the 
complete absence of detectable dystrophin, indicates 
MD. A normal control is established by repeated 
analyses of the dystrophin polypeptide found in normal 
individuals without MD or individuals without MD related 

30 dystrophins. 

In general, the normal dystrophin polypeptide 
is approximately 400,000 daltons in size and is absent 
or undetectable in DMD patients. However, BMD patients 
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and a small percentage of patients diagnosed as those 
with DMD exhibit dystrophin of an abnormal size, i.e., 
in the form of smaller M.W. fragments. 

In a preferred embodiment, this method features 

5 the additional step of comparing the molecular weight of 
the sample antigenic polypeptide to the molecular weight 
of BMD and DMD controls. In this method diagnosis of 
DMD is made in the presence of a pattern comparable to 
the DMD control, diagnosis of BMD is made in the 

10 presence of a pattern comparable to the BMD control, and 
the absence of MD is indicated by a pattern comparable 
to the normal control. The BMD and DMD controls are 
established by repeated assays of dystrophin in muscle 
samples in patients diagnosed as having BMD or DMD. The 

15 molecular weight of the dystrophin or fragments thereof 
are determined for each type of MD and are recorded and 
averaged to create a control pattern. A pattern 
"comparable" to these controls is a pattern that is 
identical to, or sufficiently similar to, these controls 

20 to allow a diagnosis of one type of MD, i.e., either BMD 
or DMD. 

The invention also features a method of therapy 

4 

for MD involving the insertion of cDNA, or a fragment 
thereof, corresponding to the MD gene into a vector and 

25 reintroducing these genetically altered cells back into 
the patient. The cells are injected into the 
bloodstream or muscle tissue to produce dystrophin in an 
amount effective to control the degeneration of muscle 
fibers and to control the proliferation of connective 

30 tissue within muscle fibers. 

The methods of the invention can be used for 
prenatal (fetal) screening as well as in the detection 
of heterozygotic individuals carrying MD gene mutations 
but exhibiting no symptoms. 
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Other features and advantages of the invention 
will "be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

Description of the Preferred Embodiments 
The Figures will first briefly be described. 

Drawings 

Fig. 1 is a standard ISCN (1981) ideogram of 
the p terminal end (pter) of the human X chromosome 
showing the approximate positions o'f cloned DNA 
fragments and specific deletion and translocation break 
points; 

Fig. 2 is a diagrammatic representation of 
220kb of cloned DNA from the human Xp21 chromosome, 
showing regions deleted in some DMD human males; 

Fig. 3 is the DNA sequence of two parts of the 
MD gene used as probes of the invention; 

Fig. 4 is a diagrammatic representation of 
220kb of cloned DNA showing the location and size of 
cloned DNA isolated from librares of human chromosomal 
DNA; 

Fig. 5 shows 13 kb of the 14 kb nucleotide 
sequence corresponding to the human MD gene. 

Fig. 6A is a restriction map of the 14 kb cDNA 
encoding the MD gene; 

Fig. 6B is a diagrammatic representation of 
cDNA fragments that in combination span the entire 14 kb 
cDNA; 

Figs. 7A and 7B are partial nucleotide and 
predicted amino acid sequences of human and murine MD 
cDNAs ; 

Fig. 8 is the amino acid sequence of the 
dystrophin polypeptide. 

Fig. 9 is a photograph of a Southern blot 
analysis of genomic DNA isolated from a DMD family, 
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probed with pERT87-8, and a schematic representation of 
the genotype of each family member, circles represent 
females, squares represent males, darkened areas 
indicate the presence of a MD mutant gene. 

5 Fig. 10 is a schematic of a DNA family 

polymorphism study (for Family A) in which each vertical 
bar represents an Xp21-p22 chromosome region, and each 
letter designates a different polymorphic DNA segment; 

Figs. 11A and 11B are photographs of 

10 immunostained Western blot gels containing muscle 
samples of DMD, BMD and OMD patients. 

Fig. 12 is a photograph of an immunostained 
Western blot containing muscle samples of BMD patients 
and normal non-MD controls . 

15 Fig. 13 is a set of charts depicting the 

distribution of dystrophin biochemical phenotypes. 
Human MD Probes and Nucleic Acid 

Probes of the invention contain nucleic acid 
homologous to DNA of the MD gene or to DNA from a region 

20 of DNA close to the MD gene. A description of the DNA 
region containing segments to which the probes are 
homologous follows. 

Referring to Fig. 1, the approximate positions 
of previously cloned regions of the human X chromosome 

25 (D2, RC8 , 99.6, 754, OTC C7, B24, and L128) are shown 
relative to a standard (ISCN 1981) ideogram of the short 
arm (p) of the human X chromosome. Also shown is the 
region of DNA deletion in the X chromosome of a male 
patient (BB) suffering from three X-linked disorders, 

30 including DMD; the deletion is described by Francke et 
al.. Am. J. Hum. Genet. 37:250 (1985), and the cell line 
is available from Human Genetic Mutant Cell Repository 
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in Gaicideri, N.J. (cell line repository number GM7947A) . 
The deletion is defined by two break points in the DNA, 
Xp21.3 and Xp21.1. Also marked is a break point X;ll 
(p. 21-1; a 13.5), which is attributable to the balanced 

5 reciprocal translocation described by Greenstein et al., 
Cytog. Cell Genet. 27:268 (1980) (cell line available 
from Human Genetic Mutant Cell Repository in Camden, 
N.J., cell line repository identification number GM1695). 

Break points Xp21.3 and X;ll approximately 

10 define the boundaries of the DMD gene, which lies within 
the Xp21.2 chromosome band region. More specifically, 
the MD gene extends approximately lxio bp beyond both 
the BB deletion breakpoint towards the p terminus and 
beyond the X;ll translocation breakpoint toward the 

!5 centromere. 

Following are examples of probes and nucleic 
acids suitable in this invention. These examples are 
not limiting to the invention and those skilled in the 
art will recognize other methods suitable to obtain such 

20 probes. In particular, the probes described below can 
be used to probe other libraries of other animals to 
isolate equivalent DNA or RNA. 

Example 1: pERT87, A Genomic Clone 
In order to isolate the probes of the 

25 invention, a library of DNA from around the MD gene area 
was constructed by enriching for this DNA using 
differential hybridization. The procedure is described 
in detail by Kunkel et al., PNAS 87:4778 (1985). 
Accordingly, 250 pg of DNA from the male patient BB 

30 was isolated, sheared by sonication to a mean size of 
1,Q0Q base pairs (bp), and hybridized to 1.25 \xg of 
Mbo-l cleaved DNA from a 49, XXXX? lymphoid cell line 
(GM1202), as described by Palmer et al.. Cell 37:171 
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(1984). The DNAs were then heated to 100 C for 5 
minutes, cooled on ice, and added to a final reaction 
volume of 2.5 ml of 7% phenol. The mixture was shaken 
intermittently for 37 h over a 5 day period (Kohne et 

5 al., Biochemistry 16:5329 (1977)). After chloroform 
extraction and dialysis to remove salts, the hybridized 
DNA was ethanol precipitated. These conditions allow 
Mbo-1 fragments from the 49, XXXXY DNA, with no 
complementary sequence in the deleted DNA region of the 

10 bb male, to self-hybridize and thus be cloned as Mbo-l 
fragments. 5 \ig of the precipitated DNA was ligated 
to 0.1 \iq BamH l-c leaved dephosphorylated pBR322 and 
transformed into coli strain MC1061. (The BamHl ends 
are compatible in ligation with the Mbo-1 ends of the 

15 chromosomal DNA.) 3,000 colonies were isolated. 

As described by Kunkel et al., supra , the 
isolated colonies were screened using standard 
techniques to determine which one contained DNA that 
hybridized with DNA from normal individuals but not with 

20 DNA from the male BB. One clone, pERT87, hybridized to 
a 1.1 kb Hindi I I fragment of normal human DNA, but not 
to BB DNA. The intensity of hybridization was dependent 
upon the X chromosome content of the DNA being probed. 
Hybridization was also detected in a rodent-human hybrid 

25 cell line having an intact human X chromosome, but not 
with a cell line having DNA only from the human 
chromosome region Xpll.3 to Xqter. Thus, the cloned 
region in pERT87 is on the short arm (p) of the X 
chromosome. Three other clones gave similar results, 

30 although each clone hybridizes to a different Hindi I I 
f r agment . 

The clones were analyzed for their ability to 
hybridize with DNA from a variety of cell lines having 
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deletions or translocations of the human X chromosome. 
pERT87 was localized to DNA within the Xp2l-Xp22 
region. The location of pERT87 was determined by its 
lack of hybridization to the region Xp21-Xter (using 

5 cell line X;ll) and to the region p2l~pter (using a cell 
line with a break point at Xp2l.3, Fryns et al., Clin., 
Genet. 22: 76, (1982)), and by its hybridization to the 
region Xp22-Xqter (using a cell line described by 
Mahandas et al., PNAS 76:5779 (1979)). 

10 As described in Monaco et al., Nature 316 :842 

(1985), pERT87 was used to probe the DNA of 57 unrelated 
DMD males. pERT87 did not hybridize to DNA from five of 
the males indicating that they had deletions of DNA 
around the cloned DNA in pERT87. Assuming that the DMD 

15 phenotype in these five males results from deletions 
within the MD gene, this result provides evidence that 
pERT87 contains DNA homologous to a region of DNA within 
the MD gene. 

Example 2: pERT87 Derivatives 

20 In order to isolate more DNA from the MD 

region, two human genomic libraries were screened with 
pERT87 by the procedure described in Monaco,, supra ; one 
was constructed in Charon 35 (Loenan et al., Gene 26: 171 
(1983)) and one in EMBL-3 (Frischauf et al., J. Molec. 

25 Biol. 170 :827 (1983). Using standard techniques, 

chromosomal "walks 11 were performed in both directions 
from the region homologous to pERT87 using pERT87 as a 
probe for the above libraries. Five bacteriophage 
clones, including Wl and W2-R in Fig. 4, were isolated, 

30 and small unique-sequence subclones, in pBR322, pUCl8 or 
Blue-Scribe (Stratagene) , were constructed. These 
subclones were restriction mapped and small fragments 
free of human repetitive elements identified. Unique 
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sequence subclones near the extreme ends of the human 
inserts in these clones were used as probes for the next 
walking step in the same libraries. Three of these 
subclones, pERT87-18, -8, and -1 contained DNA absent 

5 from the above five DMD boys with genomic deletions. 
These subclones span an area of 38 kb. Further 
chromosome walks in the EMBL-3 library extended the 
cloned area to 220 kb (shown in Figs. 2 and 4). These 
clones include pERT87-l5, -14, and -27. 

10 To determine the positions of these cloned 

regions relative to the MD gene, chromosomal DNA was 
isolated from 53 boys with DMD or the related disease, 
Becker muscular dystrophy (BMD) — the gene for which is 
also located within band Xp21 — who have deleted areas of 

15 DNA in their X chromosome. This DNA was tested, using 
standard Southern blot analysis (see below), against the 
subclones in order to determine the break points of the 
DNA deletions in the 53 boys relative to the cloned 
DNA. This was simply done by first ordering the 

20 subclones as shown in Fig. 4, and then determining which 
subclones contained DNA present in the deleted 
chromosomes (thus being outside the break point) and 
those which contained DNA not present in the deleted 
chromosome (thus having DNA within the break point). 

25 subclones that are positioned between these two types of 
subclones and have DNA present in the deleted 
chromosomes, must have DNA in which the deletion break 
point occurred. Once localized to a region within the 
* cloned DNA, the break points were further localized 

30 within 100-1,000 bp using restriction enzyme analysis. 
The results of these experiments are summarized in Fig. 

2. 

Fig. 2 depicts a schematic drawing of 
220 kb of contiguous human genomic DNA with a kb scale. 
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The darkened blocks represent regions- of repeated 
sequences, cross hatched boxes represent moderately 
repeated sequences, and open boxes represent unique 
sequences. Numbers below these blocks represent pERT87 

5 subclones. Line sequences are represented by an L, as 
described in Singer, Cell 28:433 (1981). As shown in 
Fig. 2, fourteen deletions were found to break at the 
left side of the cloned DNA and extend in a rightward 
direction (see arrows) towards the centromere. Nine 

10 deletions were found that extend in the opposite 
direction. The majority of the deletions have 
independent break points, but for simplicity the 
deletions have been schematically presented together in 
Fig. 2. Since the deletion breaks, which give rise to a 

15 DMD or BMD phenotype, occur within the pERT87 region and 
extend in both directions, this analysis demonstrates 
that pERT87 and the subclones are clearly within the MD 
gene. 

The profile in Fig. 2 indicates that the MD 
20 gene is located on a large segment of DNA. The fact 
that large deletions resulting from mutations in the 
gene yield a DMD phenotype similar to that found in 
other DMD boys who do not have deletions indicates that 
the gene product can be either completely missing or 
2 5 aberrant yet still yield a similar clinical picture. 
The gene responsible lies on a very large segment of X 
chromosome DNA, and is estimated to be approximately 
2,000 kb (Fig. 1). 

Referring to Fig. 3, the DNA sequence of 
30 several subclones are determined by standard procedures, 
using dideoxy analysis. Representative sequences from 
pERT87-4 (upper sequence) and -25 (lower sequence) are 
presented. Probes suitable for the invention can 
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readily be synthesized from any parts of these 
sequences, or any other parts of the 14 kb cDNA sequence 
corresponding to an MD gene described below. Such 
probes can also be used for further chromosome walking. 

Example 3; cDNA 

5 Standard Northern analysis of RNA from a 

variety of human fetal and adult tissues revealed a 
single mRNA estimated to be 16 kb from the total RNA or 
polyA-selected RNA of fetal skeletal muscle which 
hybridizes to pERT87-25. This approximately 16 kb 

10 transcript is the mRNA coding for the product of the MD 
gene. It is possible to isolate this mRNA, using 
standard hybridization techniques, and the probes of the 
invention, and to prepare a cDNA library from it. The 
clones in this cDNA library are suitable as probes of 

15 the invention, and for isolating other clones by 

chromosome walking in human genomic DNA libraries by 
standard techniques towards intronic DNA in the MD gene 
and to the X;ll and Xp21.3 break points. 

For example, by using standard techniques, an 

20 oligo (dT) primed cDNA library is constructed in the 
phage vector lambda gtll using a poly A-selected RNA 
sample from human fetal skeletal muscle. Small 5 '-ended 
clones were isolated from this library. Similarly, a 
human fetal skeletal muscle cDNA library can be 

25 constructed in the phage vector lambda gtio using 

standard techniques as described in Koenig et al . , Cell, 

50:509-517 (1987). 

Such a library was used to isolate six cDNA 
inserts detected with the above 5' -end probes. All six 
30 ended within an estimated 100 bp interval (the five 

longest within a 50 bp interval). This result indicates 
that the 5' end of the transcript is probably present in 
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these clones. Two additional "walks" taken in the 3' - 
direction using the most 3' segment clones from these 
probings also ended within 100 bp of each other, 
indicating that the 3' end of the transcript is probably 

5 present in these clones. As a result, the actual size 
of the mRNA or cDNA corresponding to the human MD gene 
is known to be 14 kb rather than 16 kb estimated in 
earlier studies described above. 

Once the 5' most and 3 r most ends of the human 

10 md gene transcript are determined, the entire 14 kb cDNA 
is sequenced using standard techniques as described in 
Koenig et al., Cell, 50:509-517 (1987). The nearly 
complete sequence is described in Fig. 5. The actual 
cDNA can be obtained from the series of cDNA probes 

15 deposited with the ATCC that in combination span the 
entire human MD gene. These probes are deposited tinder 
the twelve access numbers 57666 to 57677 and are 
depicted, in order, next to the 14 kb MD gene in Figure 
6. 

Z0 These cDNA clones are tested to determine 

whether any clones are particularly suitable as probes 
of the invention. The clones are hybridized, using 
standard methods, to cell lines having the Xp21.3 and 
X;ll mutations and to mRNA from human cell lines. 

25 Suitable probes are homologous to DNA from within the 
Xp21.3 -X;ll region or within the mRNA. Referring to 
Fig. 1, suitable probes have homology to normal human 
cell lines but at least part of their DNA will not have 
homology to DNA from cell line GM7947A and the 

30 nontrans located DNA of the cell line GM1695. A further 
test entails a cytological study to determine whether 
the clone hybridizes to the Xp21.2 chromosomal band, and 
thus would be suitable. 
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Detection of RFLPs 

The probes of the invention are useful in 
detecting RFLPs that are closely linked with a mutated 
MD gene that gives rise to the DMD phenotype. 
Detection of such RFLPs allows diagnosis both of women 
who are carriers of a mutated MD gene and of unborn 
children who have a mutated MD gene. 

Example 5: TTsina pER™ 7 Derivatives 
Using standard techniques, the subclones 
PERT87-8, and pERT87-15 were tested for their ability to 
detect human DNA RFLPs by hybridizing to nitrocellulose 
filters containing immobilized DNA samples from four 
unrelated females that had been cleaved with 24 
different restriction enzymes. The frequencies of the 
15 RFLPs found in the human population were estimated by 
examining DNA obtained from 37 unrelated persons . The 
results of the search are displayed in Table 1, below. 
Seven potential polymorphic restriction enzyme sites 
were identified as individual variations in the pattern 
of restriction fragments observed. Two of the 
subclones, pERT87-l and pERT87-8, each detected putative 
RFLP's of two different enzymes, while one probe, 
PERT87-15, detected RFLPs of three different enzymes. 

Table l 



1. Subclone 

2. pERT87-l 
3. 

4. pERTB7-8 
5. 





Allele 


Allele 




Enzyme 


Sizes (Icb) 


Frequency 




p* q** 


P 


g 


BstNI 


3.1 2.5/0.6 


0.63 


0.37 


XmnI 


8.7 7.5 


0.66 


0.34 


BstXI 


4.4. 2.2 


0.6 


0.4 


TaqI 


2.7/1.1 3.8 


0.71 


0.29 
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10 



15 



20 



25 



30 



1. Subclone 



Enzyme 
BamHI 
TaqI 
XmnI 



Allele 
Sizes (kb) 

7,1/2.3 9.4 



Allele 
Frecruency 

0.62 0.38 



3.1 
1.6/1.2 



3.3 
2.8 



0.67 
0.68 



0.33 
0.32 



6. pERT87-15 
7. 
8. 

* common allele 
** rare allele 

The next step was to screen the RFLPs clones 
for linkage to the DMD phenotype. An RFLP is linked to 
the MD gene within a particular pedigree (family) if 
within the pedigree, persons whose DNA contains a mutant 
MD gene giving rise to the DMD phenotype have a 
polymorphic distribution different from those persons 
within the pedigree whose DNA does not contain a mutant 
MD gene. The RFLP screening was performed on pedigrees 
known to harbor a mutant MD gene. 

The first step was to obtain DNA from a 
subject. In the method described below, peripheral 
blood lymphocytes are used; other possible sources of 
DNA are amniotic fluid, chorionic villus, and fetal 
trophoblasts. 

Using standard techniques, genomic DNA is 
isolated from cell nuclei of whole blood leukocytes as 
described by Aldridge et al., Am. J. Hum. Gen., 36: 546 
(1984). This genomic DNA is then analyzed to determine 
whether linkages exist between the RFLPs of Table l and 
defective MD genes of certain families. 

An example of such linkage is presented in 
Figure 9. In the example, DNA was isolated from 
individuals within a family in which some people had a 
mutated MD gene (indicated by darkened symbols) the DNA 
was cleaved with BstX I , and probed with pERT87-8. 
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Electrophoresis, blotting, and hybridization were as 
described above. On BstXI digestion of genomic DNA, 
pERT87-8 hybridizes to 4.4 and 2.2 kb BstXI fragments in 
females heterozygous for the RFLP. The carrier mother 

5 exhibits the heterozygous pattern of the 4.4/2.2 kb 
BstXI alleles, as do her two carrier daughters (lanes 
2,4). The lower allele (2.2 kb) is present in both 
affected DMD sons (lanes 5,6) while only the upper 
allele (4.4 kb) is present in the unaffected son 

10 (lane 3). The father of this family has a 4.4 kb BstXI 
hybridizing fragment (data not shown). The female 
individuals in lanes 2 and 4 were suspected of being 
carriers of the DMD trait based on elevated CPK values . 
Fragment sizes were calculated by comparison with 

15 32 p-end-labelled Hindlll-cleaved X phage DNA. 

Thus, the mutated MD gene is linked to the BstXI RFLP in 
this family, and individuals with 2.2 kb fragment also 

have a mutated MD gene. 

Linkage between each RFLP listed in Table l and 

20 a defective MD gene does not occur in every pedigree. 
Accordingly, a combination of probes, detecting a 
plurality of RFLPs , is preferably used when examining 
DNA of an individual. Examining a combination of RFLPs 
listed in lines 2, 3, 5, 6, and 7 of Table l with the 

25 appropriate probes was found to be informative in 25 of 
28 (89%) pedigrees in which members were known to have a 
mutated MD gene. It is also possible to use probes of 
the invention for presymptomatic screening of humans 
without a DMD phenotype for deletions (of greater than 

30 ioo bps) in the MD gene. 

Example 6; Using cDNA Probes 
Although RFLP DNA marker studies are usually 
very useful, they are often impossible or inconclusive 
because family members may be unavailable, new mutation 
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events cannot be detected reliably, meiotic crossovers 
within the gene occur frequently and the inheritance 
pattern (X-linked versus autosomal) cannot be determined 
with certainty in sporadic cases. 

5 Detection of molecular deletions with the 

complete gene cDNA probes of the invention eliminates 
most of these difficulties. For example, the probes of 
the invention deposited with the ATCC under the access 
numbers 57666 to 57677, inclusive, are very useful 

10 diagnostic tools as described below. 

NUclear DNA of patients was isolated from blood 
leucocytes, lymphoblastoid cell lines and chorionic 
villi, cleaved with restriction endonuc leases, 
transferred to Hybond (Amersham) filters, and hybridized 

!5 with oligo-labelled probes, by standard techniques. 
cDNA segments covering the complete MD gene in a 5' to 
3' direction, cloned in M13 vectors Bluescribe or 
Bluescript (Stratagene) , are used as probes. For 
example. Figure 6 shows the probes that have been 

20 deposited with the A.T.C.C. Probes l-2a and 2b-3 

comprise the most 5' 2.6 kb Bql ll- EcoR I fragment. Probe 
4-5a is a 1.8 kb fragment and probe 5b- 7 has a 2.6 kb 
insert. Probe 8 is a 0.9 kb EcoR I fragment. Probe 9-14 
is a 6.1 kb fragment that includes the 3' exons and the 

25 3 r untranslated region. 

Deletions in the MD gene do not occur in a 
random distribution along the cDNA, but a large 
percentage of the deletions can be detected with only a 
few probes. In a total of 104 DNA samples of unrelated 

30 DMD boys 53 exhibited a deletion of part of the MD 
gene. For diagnostic purposes, 27 deletions are 
detected with probe 8, and 15 others with probe l-2a. 
These two probes detect nearly 80% of the deletions 
observed (42 of 53). 
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Particularly intriguing is the large number of 
deletion breakpoints found to originate within a single 
intron defined by probe 5b-7. One intron defined by the 
exons contained in this probe exhibits 19 deletion 

5 breakpoints (36% of the 53 total deletions found) . 

Clearly this particular intron contains a characteristic 
that imparts a propensity for initiating deletion 
events. The multiple breaks could be due to a 
sequence-specific rearrangement hot spot, or an 

10 extraordinarily large genomic distance spanned by this 
intron. 

The following cost-efficient strategy for the 
DNA-di agnostic work-up of DMD/BMD families is based on 
these results. Initially, DNA from affected 

15 individuals, their mothers, obligate carriers and normal 
male and female controls is cleaved with Bglll and 
Hin di II, electrophoresed and blotted onto nylon 
membranes that can be easily rehybridized. These 
filters are first hybridized with probe 8, exposed to 

20 X-ray film, and then rehybridized with probes l-2a and 
2b-3 in a sequential fashion. If no deletion is 
detected, probes 5b-7 and 4-5a are used next, and probe 
9-14 is used in the final round. It is advisable to use 
two different restriction enzymes because fragments of 

25 similar size may co-migrate and appear as a single band 
on the autoradiogram. If only one of the co-migrating 
fragments were deleted, such a deletion could be 
missed. By this approach deletions should be detectable 
in 50% of affected males, and by careful dosage 

30 comparisons in 50% of female carriers. This direct 
method yields unambigous diagnoses of hemi- and 
heterozygosity eliminating the need for indirect linkage 
testing in at least half of all families. The routine 
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analysis of families at risk for MD using the complete 
MD cDNA could dramatically decrease the cost and labor 
involved and greatly increase the accuracy of other 
diagnostic methods. 

5 If no deletions are found, DNA polymorphism 

studies as described above are undertaken. These need 
to include closely linked flanking probes or segments 
near the 5' and the 3 1 ends of the gene as well as 
intragenic markers as argued above. 

10 Following is an example of the use of cDNA 

probes. Family A (Fig. 10) represents a common clinical 
situation. That is, having only one male affected with 
DMD (III-l). The carrier status of his mother (II-2) 
was unknown, since her creatine kinase levels were in 

15 the normal range , as were her mother's (1-2). Her male 
fetus (III-2) was at risk. A DNA polymorphism study was 
carried out to determine whether the present fetus 
(III-2) had inherited the same X-21 region as had his 
affected brother. The individuals shown in Fig. 10 were 

20 tested for 21 DNA polymorphisms detected by 18 different 
probes, some of which are flanking the MD locus and some 
were subsequently shown to be derived from the gene 
itself. No deletions were seen with these probes. 
Haplotype analysis revealed that the affected male 

25 (III-l) had received his Xp21 chromosome region from his 
grandfather (1-1), transmitted without apparent 
crossover. Therefore, his grandmother (1-2) should not 
be a carrier, but the carrier status of his mother 
(I 1-2) remained unknown. Either she or her affected son 
30 (III-l) could represent a new mutation. Based on the 
DNA-marker results, the risk of the fetus (III-2) being 
affected was considered higher than the original 20% 
estimate. To prevent the abortion of a potentially 
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unaffected fetus the usefulness of the complete gene 
cDNA probes of the invention was evaluated for detection 
of the molecular defect in the affected male. 

For cDNA studies, the entire gene was surveyed 

5 for the presence of missing or abnormal-sized 

restriction fragments in the affected male (III-l), his 
mother (II-2) and controls including his unaffected 
maternal grandfather (I-l) from whom the Xp21 region was 
derived. Genomic DNA samples were cleaved with Hindi I I 

10 and Bglll and hybridized with the cDNA probes 

illustrated in Fig. 6 and analyzed by known procedures. 
DNA extracted directly from CVS material (III-2) was 
digested with Hindi 1 1 and Ball I and analysed together 
with DNAs from family members (I-l, II-2, III-l), 

15 unrelated male controls (C, Ml - M5) and a normal female 
control (F) . 

The mother (II-2) is apparently homozygous for 
all the fragments deleted in her son (III-l). The 
deletion detected with two adjacent segments of the cDNA 

20 likely represents the molecular basis of the DMD 

phenotype in the affected male. Therefore, the fetus 
"at risk" was determined to be unaffected with a 99% or 
higher probability. 
Mouse MP Probes and Nucleic Acid 

25 The above human cDNA was used as a probe in 

Northern blot analysis of mouse tissue mRNA. The human 
cDNA detected a 14 kb mouse mRNA species, which is 
present in very low levels in tissue extracts of mouse 
newborn leg and gravid (15 day) combined uterus and 

30 placenta and in higher levels in newborn heart, adult 
heart, and adult skeletal muscle. 

In these studies, RNA was isolated from mouse 
tissues by homogenization of frozen, ground tissues in 
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guanidium thiocyanate followed by pelleting through a 
CsCl cushion as described in Chirgwin, J.M. et al., 
Biochemistry 18:5294 (1979). 

In order to obtain cDNAs representative of the 

5 mouse Mb gene, a cDNA library was constructed from adult 
mouse heart as described in Gubler, U. et al.. Gene 
25:263 (1983) as modified in Hoffman, E.P. et al.. 
Science 238:347-350, n.12 (1987), and screened with 
probes of cDNA corresponding to the human MD gene. 

10 Seven mouse cDNA clones were obtained from 5 X 10 5 
primary (unamplif ied) \ gtlO recombinants, using 
standard techniques. 

The low frequency of MD clones obtained in both 
human and mouse cDNA libraries suggests that the MD 

15 messenger RNA (mRNA) is rare, probably representing 
about 0.01 to 0.002% of the mRNA in muscle. This 
relative abundance is in agreement with Northern blot 
analyses and the frequency with which recombinant DMD 
clones were obtained from four other mouse and human 

20 cDNA libraries constructed in our laboratory. 

Fragments of the mouse cDNA were used as probes 
for Southern blots of mouse genomic DNA using standard 
procedures as described in Hoffman et al.. Science 
238 :347-350 (1987). In view of the 14 kb size of the 

25 complete mRNA, and assuming a constant ratio of cDNA to 
genomic DNA, the complete mouse MD genomic locus 
probably encompasses more than 500 kb of genomic DNA. 
Thus, the hybridization characteristics of the X-linked 
human MD cDNA described above are also conserved in the 

30 mouse. 

The DNA sequences of the first about 4.3 kb 
mouse MD cDNA was determined (Figs. 7A and 7B) and 
compared to human cDNA sequence. The total nucleotide 
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sequence of 4.3 kb represents nearly 30% of the entire 

DMD mRNA. 

Interspecies comparison of the sequences from 
the 5' end of the mouse and human cDNAs (Fig. 7a) show 
5 that the first 200 bp are 80% homologous with multiple 
insertion/deletion differences and translation stop 
codons in three reading frames. Furthermore, the first 
translation initiation codon is conserved in both humans 
and mice. 

10 Both the DNA and amino acid sequences are well 

conserved, exhibiting 88% homology with the DNA and 87% 
with the amino acid sequence. There is a particularly 
striking conservation in the hydropathicity profile of 
the mouse and human amino acid sequences, the 

15 hydropathicity profiles being nearly identical. These 
profiles were determined according to Kyte et al., J. 
Mol. Biol., 157:105 (1982). Indeed, by conservative 
amino acid substitutions based on hydropathicity values, 
the mouse and human polypeptides become more than 95% 

20 homologous. 

Recombinant Dystrophin and F ragments Thereof 

The cDNA and predicted amino acid sequence for 
the 14 kb human or mouse MD gene can be used to produce 
recombinant dystrophin, which is named after the 
25 muscular dystrophy gene to which it corresponds. The 
amino acid sequence of dystrophin is shown in Fig. 8. 

For example, MD cDNA may be inserted into an 
appropriate vector, introduced into a cell and 
transcription and translation of the DNA induced, to 
produce dystrophin. Similarly, smaller fragments may be 
produced by using only a part of the total cDNA. 
Myoblast cells are a suitable expression system. They 
are cultured and myotube formation induced by standard 
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techniques, as described by Yasin et al., J. Neurol. 
Sci., 32:347-360 (1977). Other systems include 
bacteria, such as E. coli, Yeast, and cultured mammalian 
cells. In such systems, the dystrophin may be secreted 

5 or not and may be linked to any of several standard 
promoters or terminators. Dystrophin may also be 
produced by in vitro translation of mRNA encoding 
dystrophin. It may also be produced synthetically. 

The above examples are not intended to limit 

1° the present invention but are merely suggestions as to 
how dystrophin and fragments thereof can be produced. 

The invention encompasses cDNAs, mRNAs, and 
polypeptide sequences that are modified but 
substantially correspond to the cDNA and polypeptide 

IS sequences defined herein. A nucleic acid or polypeptide 
sequence substantially corresponds to a given sequence 
when it is identical except for conservative nucleotide 
or amino acid substi tut ions. 

Example 4: Dystrophin Fusion Polypeptides 

20 The DNA and predicted amino acid sequences for 

4.3 kb (30%) of the mMD gene described above were used 
to produce fusion polypeptides and substantially 
purified mouse dystrophin . 

Two different regions of the mouse heart mMD 

25 cDNA were fused to the 3 1 terminus of the E^ coli trpE 
gene using the expression vector pATH2 (Dieckmann et 
al., J. Biol. Chem 260:1513-1520 (1985)). The two 
regions represent the majority of the mouse heart cDNA 
sequence described above and shown in Fig. 7B. One 

30 construction resulted in the fusion of approximately 30 
kD of mMD dystrophin to the 33 kD trpE polypeptide, 
while the second fused roughly 60 kD of the mMD 
polypeptide. Since the trpE polypeptide is insoluble, 
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quantitative yields of induced fusion polypeptides were 
obtained by lysing the cells and precipitating insoluble 
polypeptides. Novel insoluble fusion polypeptides of 
the expected size were produced which were not present 
5 in lysates of bacteria containing the pATH2 vector alone. 

To produce the trpE + 60 kD fusion, the mouse 
MD cDNA was restricted at the unique Spel site, 
blunt-ended with Klenow, and then digested with Hindlll 
in the 3' polylinker. The excised cDNA fragment of 1.4 
10 kb was gel purified and ligated to pATH2 which had been 
digested with Sma l and Hindlll. Recombinants were 
identified by colony hybridization and verified by 
subsequent plasmid DNA restriction analysis. The 
resulting plasmid construction fused the trpE 
15 polypeptide (33 kd) to 410 amino acids (-60 kD> of the 
mMD polypeptide, and corresponds to position 1.3 kb to 
2.7 kb on the equivalent human cDNA map. 

To produce the trpE + 30 kD fusion, the most 3 ' 
end of the mouse cDNA was restricted at its unique 
20 non-methylated Xbal site, and at the BamHI site in the 
3' polylinker. The excised 700 bp fragment was ligated 
to pATH2 and digested with Xbal and BamHI as described 
above. This plasmid construction fused the trpE 
polypeptide to 208 amino acids (~30 kD) of the mDMD 
25 polypeptide, and corresponds to position 3.7 kb to 4.4 
kb on the equivalent human cDNA map. 
Antibodies to Dystrophin or Fragm ents Thereof 

Antibodies can be raised by standard techniques 
to any of the above described recombinant dystrophin, or 
30 fragments thereof. Further naturally ocurring 
dystrophin may also be used. The following is an 
example of such a method and is not limiting to this 
invention. 



WO 89/06286 

> 



PCT/US88/04504 



- 28 - 



Both of the fusion polypeptides described above 
were purified by preparative SDS-polyacrylamide gel 
electrophoresis (Laemmli, Nature 227:680-685 (1970)), 
and used to immunize rabbits and sheep. Rabbits were 
5 immunized with electroeluted, 'native* (free from SDS) 
insoluble antigen, while sheep were immunized with 
SDS-polyacrylamide gel slices containing denatured 
antigen. 

' The titers and specificity of the antibodies 
10 produced in each rabbit were constantly monitored by 
enzyme-linked imrauno-assays performed on nitrocellulose 
dot blots of insoluble polypeptide fractions. The best 
immune responses were obtained using the trpE + 30 kD 
polypeptide with >95% of the antibodies produced being 
15 specifically against the mDMD portion of the fusion 

peptide, and with titers greater than the sensitivity of 
the ELISA assay system when using a 1:1000 dilution of 
crude serum 4 weeks after immunization. The trpE + 60 
kD antigen took much longer (12 weeks) to evoke a immune 
20 response in rabbits, with the resulting sera showing a 
low specificity for the MD portion of the fusion 

polypeptide. 

To ensure that any polypeptide species 
identified by the antisera was due to recognition by 

25 antibodies specific for the MD portion of the fusion 
polypeptides, rabbit and sheep antibodies directed 
against the 30 kD antigen, and sheep antibodies directed 
against the 60 kD antigen were affinity purified. 
Affinity purification (AP) of the antibodies directed 

30 against the MD portion of the fusion polypeptide is 
facilitated by the insolubility of the partially 
purified fusion polypeptide. In general, by simply 
resuspending crude insoluble trpE polypeptide fractions 
in immune serum, antibodies against the trpE polypeptide 
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are eliminated. Antibodies specific for the MD 
polypeptide are then isolated by binding with fusion 
polypeptides. These antibodies allow detection of as 
little as 1 ng of dystrophin in 50 \iq muscle tissue. 

5 For example, approximately 3 mg of a partially 

purified insoluble fraction of trpE polypeptide is 
precipitated, resuspended in 10 mM Tris (pH 8.0), and 
then precipitated again. The pellet is resuspended in 
1.5 ml of immune serum, incubated on ice for 1 hour, 

10 then centrifuged to pellet the trpE-antibody immune 

complexes, which are discarded. The supernatant is then 
mixed with approximately 3 mg of partially purified 
fusion polypeptide (insoluble fraction) which has been 
washed as above. After incubation on ice, the 

15 mDMD-antibody immune complexes are precipitated by 
centrifugation. The pellet is then resuspended in 500 
ul of 0.2 M glycine (pH 2.3), incubated on ice for 5 
minutes to disassociate the immune complexes, and then 
centrifuged at 4° C to precipitate the insoluble 

20 antigen. The supernatant contains purified anti-MD 
immunoglobulins and is neutralized with 50 ul Tris (pH 
9.5), and either stabilized with BSA (fraction V) 5 
mg/ml) or dialyzed extensively against phosphate 

buffered saline (PBS). 

25 The invention also contemplates the production 

of monoclonal antibodies to dystrophin or fragments 
thereof. Such monoclonal antibodies are secreted by 
hybridomas produced by standard techniques. 
Natural Dystrophin 

30 Naturally occur ing dystrophin can be identified 

using the above-described antibodies. For example, 
total polypeptide samples were isolated from mouse 
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(fresh) and human (frozen) tissues by direct 
solubilization of tissues in ten volumes of gel loading 
buffer (100 mM Tris pK 8.0, 10% SDS, 19 mM EDTA, 50 mM 
DTT) . Alternatively, Triton X-100 insoluble fractions 

5 are isolated from human and mouse tissues by 

homogenizaton in 0.25% Triton X-100 using a Waring 
blender at full speed, and pelleting insoluble 
polypeptides. All polypeptide samples (50 ug) are 
separated by electrophoresis on 3.5% to 12.5% gradient 

10 SDS-polyacryl amide gels (Laemmli, Nature 227:680-685 
(1970)) using a 3.0% stacking gel, and transferred to 
nitrocellulose (Towbin, Proc. Nat. Acad. Sci. USA 
76:4350-4354 (1979)). Identical nitrocellulose blots of 
the separated polypeptides are incubated with affinity 

15 purified rabbit antibodies directed against the 30 kD 
antigen, affinity purified sheep antibodies directed 
against the 60 kD antigen, and affinity purified sheep 
antibodies directed against the 30 kD antigen each at a 
1:1000 dilution. Immune complexes are detected using 

20 either 125 I-polypeptide A, or alkaline phosphatase 

conjugated donkey anti-sheep IgG second antibody (sheep 
IgG binds very poorly to polypeptide A) . All antibodies 
detect a large molecular weight, apparently low 
abundance polypeptide species calculated to be 

25 approximately 400 kD in total. The higher resolution of 
the alkaline phosphatase staining resolves this 
polypeptide into doublets or triplets, though the 
slightly smaller bands most likely represent degradation 
products since there has been no evidence to date for 
30 alternatively spliced isoforms of the MD mFNA. The 40.P 
kD species is clearly evident in mouse smooth muscle . 
(stomach), though at a level that is substantially less 
than that found in cardiac and skeletal muscle. The 
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same apparent polypeptide species is detectable in mouse 
brain at an extremely low level. 

The 400 kD polypeptide species recognized by 
the antibodies of the invention is generally Triton- 

5 insoluble, though it appears to be associated more 
strongly with the myofibrillar matrix fraction in 
cardiac muscle than in either skeletal muscle or smooth 
muscle. The 400 kD polypeptide species is present in 
the skeletal and cardiac muscle of both normal and Tr 

10 mice. The detected polypeptide appears the same with a 
mixture of antisera as it does with each antiserum 
seperately, indicating that the antibodies raised 
against different antigens recognized the same 
polypeptide. Both antibodies fail to detect the 400 kd 

15 polypeptide in muscle tissues isolated from mice 
harboring either allele of the mdx mutation. 

Immunodiaqnosis 

The antibodies described above can be used in a 

variety of immunological applications to test for the 
20 presence of dystrophin or fragments thereof in 

biological samples. For example, the antibodies can be 
labelled by conventional procedures with I, S, 
or 3 H for use- in radioimunoassays, with fluorescein 
for fluorescent assays, with enzyme for enzyme 
25 immunoassays, or with biotin for biotin-avidin linked 
assays . 

These antibodies can be used labelled or 
unlabelled as desired and can be employed in competitive 
immunoassays as well as in double antibody or "sandwich" 
30 assays. Monoclonal antibodies can also be used in 
standard techniques to test polypeptides for 
cross-reactivity with dystrophin. 
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The antibodies can also be immobilized on an 
insoluble phase, such as an insoluble resin. The 
dystrophin level is then detected by measuring the 
amount of binding to the insoluble phase. Other 
5 insoluble phases include latex particles, which will 
agglutinate when coated with the novel antibodies and 
subjected to certain levels of dystrophin in the 
sample. Yet other insoluble phases include test tubes, 
vials, titration wells, and the like to which the 
10 antibodies according to the invention can be bound and 
the dystrophin "antigen" thereto detected by double 
antibody techniques or Protein A dependent techniques. 
These antibodies are also useful for purification of 
dystrophin by affinity purification as described above. 
15 m particular, the antibodies according to the 

invention can be used for the immunodiagnosis of MD. 
• Futhermore, these novel antibodies can be used to 
distinguish between Becker's MD (BHD), Duchenne's MD 
(DMD), Outlier's MD (OMD) and other neuromuscular 
20 disorders. For example, muscle biopsies of patients can 
be prepared and tested according to the invention to 
determine with a high degree of accuracy whether the 
patient has DMD, BMD, OMD, or no MD . 

To perform such a test, small portions of the 
25 biopsies (less than 0.1 g) are crushed into a very thin 
sheet of tissue using a pestle and a plastic weigh boat 
on a hard surface at -20 °C. The tissue fragments are 
then weighed at 4°C, and immediately placed in 20 
volumes of loading buffer (10% SDS, 50 mK DTT, 10 mM 
30 EDTA, 0.1 M Tris pH 8.0, 0.001% bromophenol blue) and 
vigorously shaken, samples are stored at 4°C prior to 
use . 

Samples are placed in a boiling water bath for 
two minutes, then centrifuged to remove SDS-insoluble 
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polypeptides. Supernatants (1-2 ul; 50 ug) are loaded 
on 20 well, 0.8 mm x 15 cm x 17 cm SDS-polyacryl amide 
gels (Laemmli, Nature, 227: 680-85 (1970)), using a 3% 
(0.1% bisacryl amide) stacking gel and 3.5%-12.5% 

5 . gradient resolving gel. Fractionated polypeptides are 
transferred to nitrocellulose (Towbin, H. et al., PNAS, 
76: 4350-54 (1979)), then incubated with the affinity 
purified sheep anti-mouse dystrophin described above. 
Dystrophin/ anti-dystrophin immune complexes are 

10 detected using affinity purified donkey anti-sheep IgG 
second antibody conjugated to alkaline phosphatase 
(Sigma), followed by colorimetric development. All gels 
are stained with Coomassie Blue after transfer to 
determine the efficiency of the transfer. 

15 Residual myosin in the post-transfer Coomassie 

blue-stained gels serves as a control for the amount of 
muscle tissue contained in the biopsy. The accuracy of 
this residual myosin quantitation of muscle polypeptide 
is verified by alpha-actin immunostaining after 

20 dystrophin staining is completed, or by calsequestrin 

immunostaining . 

Example 7: Muscle Immunodia gnosis 

Samples from 103 muscle biopsies were 

analyzed. All samples were scored for the abundance and 

25 apparent size of the polypeptide product of the KD gene, 
dystrophin, described above. 

The abundance and apparent molecular weight of 
dystrophin contained in 103 muscle biopsies from various 
neuromuscular disease patients was determined relative 

30 to a normal control (examples are shown in Figure 11). 
Biopsies obtained from patients previously diagnosed as 
Duchenne exhibited dramatically reduced levels of 
dystrophin (Fig. 11a, lanes 1, 2, 9-16; Fig. lib, lanes 
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3, 8, 9, 12, 14, 16) relative to the normal control 
(Fig. lla, lane 17; Fig. lib, lanes 1, 5, 17). On the 
other hand, most patients previously diagnosed as being 
afflicted with a neuromuscular disorder unrelated to 
5 Duchenne or Becker exhibited dystrophin levels 

indistinguishable from the normal control (Fig. lib, 
lanes 6, 7). The Becker biopsy samples exhibited a more 
variable dystrophin phenotype (Fig. lla, lanes 3, 5, 6; 
Fig. lib, lanes 2, 4, 10, 11). Some Becker biopsies 
!0 contained dystrophin of apparently smaller molecular 
weight but of nearly normal abundance (Fig. lla, lanes 
5, 6). One Becker shown in Figure lib, lane 2, appeared 
to have dystrophin which was larger in size, but reduced 
in abundance to approximately 40% of normal. Another 
15 Becker exhibited a dystrophin phenotype which was 

undistinguishable from normal (Fig. lib, lane 4), while 
an additional Becker contained no detectable' dystrophin 
in his biopsy (Fig. lla, lane 3). Three 'Outlier' 
biopsies (mild Duchenne/severe Becker) are also shown, 
20 two of which exhibit dystrophin of low abundance (Fig. 
lla, lane 11; Fig. lib, lane 15), while the third 
contains no detectable dystrophin (Fig. lib, lane 13). 

The size variations of dystrophin were often 
subtle, such that it was difficult to unambiguously 
25 determine size differences if the sample in question was 
not run immediately adjacent to normal controls (see two 
Becker biopsies. Fig. lib, lanes 10, 11). For this 
reason, all samples containing detectable dystrophin 
determined by the above analysis were retested adjacent 
30 to normal controls. An example of this analysis is 
shown in Figure 12. The three Becker biopsies shown 
contain dystrophin which was either clearly smaller 
(lanes 2, 6) or larger (lane 4) than that of the normal 
controls (lanes l, 3, 5). 
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The described experimental process was reported 
for each of the 103 biopsies included in the study. The 
dystrophin molecular weight (normal vs. abnormal size), 
and abundance relative to the normal control sample was 

5 thus calculated for each patient's biopsy. Five 

dystrophin 'phenotypes' were established which accounted 
for all patient dystrophin data, and which seemed to 
correlate to disease type and severity. These 
dystrophin phenotypes ranged from 'normal' (normal size, 

10 60-100% of normal abundance), to 'severe' (less than 3% 
of normal abundance) . The number and percentage of 
patients under each of four clinical categories 
(Duchenne, Outliers, Becker, non-DMD/BMD) was then 
tabulated with regards to the five dystrophin phenotypic 

15 classes (Table 2) . 

Ninety-five percent (38/40) of the patients 
previously diagnosed as having disorders unrelated to 
Duchenne or Becker muscular dystrophy exhibited 
completely normal dystrophin phenotypes. On the other 

20 hand, 92% (58/63) of the patients previously diagnosed 
as having a Duchenne/Becker dystrophy exhibited a 
clearly abnormal dystrophin phenotype. Within the 
Duchenne/Becker individuals exhibiting an abnormal • 
dystrophin phenotype there was a clear correlation • 

25 between the severity of the clinical phenotype and the 
'severity' of the dystrophin phenotype (Fig. 13, roman 
numerals correspond to categories in Table 2) . From 
this analysis it becomes apparent that the majority of 
Duchenne patients exhibit undetectable levels of 

30 dystrophin, the Outliers (mild Duchenne/severe Beckers.; 
included two affected females) exhibit low levels of . 
dystrophin of normal size, while most Beckers exhibit 
nearly normal levels of dystrophin of abnormal size. 
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There are individual patients whose clinical « 
phenotype clearly does not correlate with the dystrophin 
phenotype (Table 2, asterisks). These patients 
comprised 7.7% of the total patients studied (8/103). 
5 Table 2 

Results of dystrophin analysis in 103 patient biopsies. 

Clinical Diagnoses 

10 Duchenne Outliers Becker Non-DMD/BMD 

Dystrophin Data 

I. Normal size, 1(2.6%)* 4 (22.2%)* 38 (95.0%) 
15 60-100% level 

II. Abnormal size, 1(14.3%) 12 (66.6%) 2 (5.0%)* 
40-100% level 

20 III. Normal size, 1 (2.6%) 4 (57.1%) 1 (5.5%) 
3-60% level 

IV. Abnormal size, 1 (2.6%) 1 (14.3%) 
3-40% level 

25 

V. <3% level or 35 (92.1%) 1 (14.3%) 1 (5.5%)* 
undetectable 



30 Totals 38 7 18 40 



' 1 >**.'* 



WoiifflS---' PCT/US88/04504 



- 37 - 



Therapy 

The dystrophin or biologically functional 
fragments thereof can also be used in a method of 
therapy for MD. A biologically functional polypeptide 

S fragment is capable of causing the same biological 

effects as the normal full polypeptide. The method of 
therapy involves administering an amount of the 
dystrophin or biologically functional fragments thereof 
effective to control muscle fiber degeneration and 

10 connective tissue proliferation within muscle fibers. 

The dystrophin and related polypeptides as 
described above can be administered by routine methods. 
For example, they can be injected directly into the 
blood stream or muscle of an animal or human patient to 

15 a final concentration of between l and 500 mg/ml of 

serum, most preferably 200 mg/ml. This dose is repeated 

to maintain this level. 

A further method of therapy involves removing 

cells, such as muscle cells, from an MD patient and 
20 incorporating MD DNA into them. The 14 kb cDNA of the 

invention, or a fragment thereof, is. linked to a 

suitable promoter and inserted into a suitable vector. 

The vector is then introduced into the cells harvested 

from the patient and the genetically manipulated cells 
25 induced to produce the dystrophin polypeptide, or 

biologically functional fragments thereof. These cells 

are then reintroduced into the patient, for example, 

intravascular ly or intramuscularly. 

Deposit 

30 The complete 14kb cDNA sequence depicted in 

Figure 5 has been deposited in the form of 6 cDNA probes 
depicted in Figure 6a. These 6 probes have been 
deposited with the ATCC under the access numbers 57666 
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to 57677, inclusive. The deposit was made on July 15, 
1987. Each probe is deposited as a purified plasmid and 
as a plasmid in coli . Hence, each probe has two 
access numbers. 

5 Applicants' assignee. Children's Medical Center 

Corporation, acknowledges its responsibility to replace 
these cultures should they die before the end of the 
term of a patent issued hereon, 5 years after the last 
request for a culture, or 30 years, whichever is the 

10 longer, and its responsibility to notify the depository 
of the issuance of such a patent, at which time the 
deposits will be made irrevocably available to the 
public. Dtitil that time the deposits will be made 
available to the Commissioner of Patents under the terms 

15 of 37 CFR Section 1.14 and 35 Section 112. 

Other embodiments are within the following 

claims. 
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Claims 

1. An MD probe comprising a substantially 
purified single-stranded nucleic acid sequence capable 
of selectively hybridizing to a region of DNA on a human 

5 x chromosome between the deletion break point at Xp21.3 
and the translocation break point at X;ll. 

2. The MD probe of claim 1 comprising a 
nucleic acid sequence of at least 10 bases in length 
from a portion of the sequence of a cDNA probe selected 

!0 from the group consisting of cDNA probes deposited with 
the A.T.C.C. under the twelve access numbers 57666 to 

57677, inclusive. 

3. The MD probe of claim 1 wherein said 
nucleic acid sequence is incapable of selectively 

15 hybridizing to the region of human DNA between said 

deletion breakpoint at Xp21.3 and the p terminus of said 
X chromosome, and to the region of DNA between said 
translocation point at X;ll and the centromere of said X 
chromosome . 

20 4. An MD probe comprising a substantially 

purified single-stranded nucleic acid sequence capable 
of selectively hybridizing to at least a part of the MD 
gene . 

5. The MD probe of claim 4 wherein said 
25 nucleic acid sequence is incapable of selectively 

hybridizing to the regions outside of said MD gene. 

6. The MD probe of one of claims 1-5 wherein 
said probe sequence is capable of hybridizing to at 
least part of the human gene encoding dystrophin. 

30 7. A method for diagnosing MD in a human 

patient comprising exposing the probe of any one of 
claims 1 to 5 to a sample taken from said patient and 
detecting whether said probe hybridizes to nucleic acid 
in said sample. 
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8. The method of claim 7 further comprising 

the steps of, 

obtaining a sample of genomic DNA from said 

patient, 

5 digesting said DNA with one or more restriction 

endonuc leases , 

denaturing said DNA, 

contacting said denatured, digested DNA with a 
probe which comprises a nucleic acid sequence at least 
10 15 bases in length from the human MD gene, 

detecting hybrid DNA complexes, and 

comparing the molecular weight of said 
complexes to control complexes formed with similarly 
denatured and digested genomic DNA taken from a person 
15 not suffering from MD, 

whereby the presence or absence of deletion 
mutations in the DMD gene is determined. 

9. The method of claim 8, wherein said probe 
comprises one or more probes which in combination span 

20 said human MD gene. 

10. The method of claim 9 wherein said probes 
are chosen from those deposited with the A.T.C.C. under 
the twelve accession numbers 57666 to 57677, inclusive. 

11. Substantially pure dystrophin polypeptide. 
25 12. The polypeptide of claim 11 wherein said 

dystrophin is human dystrophin. 

13. The polypeptide of claim 11 wherein said 
dystrophin is murine dystrophin. 

14. A substantially pure polypeptide 
30 comprising a sequence of at least six amino acid 

residues of dystrophin. 

15. A substantially pure polypeptide 
immunologically cross-reactive with dystrophin. 
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16. Substantially purified nucleic acid 
encoding dystrophin or a polypeptide immunologically 
cross-reactive with dystrophin. 

17. A polypeptide synthesized from an 

5 engineered nucleic acid sequence corresponding to an MD 
gene or a segment thereof at least 18 base pairs in 
length. 

18. DNA comprising cDNA encoding dystrophin or 
a sequence of said cDNA of at least 10 base pairs in 

10 length. 

19. Substantially purified nucleic acid 
comprising an approximately 14 kb pair sequence of MD 
gene cDNA. 

20. The nucleic acid of claim 19 wherein said 
15 MD gene is a human MD gene. 

21. The nucleic acid of claim 19 wherein said 
MD gene is a murine Dmd gene. 

22. Antibodies produced to a polypeptide of 
any one of claims 11 to 15, or 17. 

20 23. The antibodies of claim 22 wherein said 

antibodies are monoclonal antibodies. 

24. A method for immuno diagnosis of MD in a 
human comprising the steps of, 

contacting a sample from said human with an 
25 antibody of claim 22 or 23, and 

detecting the presence in said sample of 
complexes of antibody and dystrophin or fragments 
thereof . 

25. The method of claim 24 wherein said sample 
30 comprises muscle tissue. 

26. The method of claim 24 further comprising 
the steps of, 

determining the molecular weight of said 
dystrophin, or fragments thereof, in said complex, 
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whereby the lack of dystrophin or the presence 
of dystrophin of abnormal molecular weight is indicative 
of MD. 

27. The method of claim 24 further comprising 
5 the steps of determining the abundance of said 

dystrophin, or fragments thereof, in said complex. 

whereby the presence of an undetectable or 
abnormally low level of dystrophin is indicative of MD. 

28. The method of claim 26 further comprising 
10 comparing said molecular weight of dystrophin 

or fragments thereof in said complex from said sample to 
the molecular weight of a polypeptide cross-reactive 
with said antibody in at least one BMD, DMD or OMD 
control and a normal control comprising normal 

15 dystrophin, 

whereby said method provides a diagnosis of DMD 
from the presence in said sample of a polypeptide of 
molecular weight comparable to said DMD control, a 
diagnosis of BMD from the presence in said sample of a 

20 polypeptide of molecular weight comparable to said BMD 
control, and a diagnosis of the absence of MD from the 
presence in said samples of a polypeptide of molecular 
weight comparable to said normal control. 

29. The method of claim 28 further comprising 
25 the steps of 

determining the molecular weight of a plurality 
of fragments reactive with said antibody in said tissue 
sample to obtain a sample molecular weight pattern, 

comparing said sample molecular weight pattern 
30 to said BMD, DMD and said normal control patterns, 

whereby said method provides a diagnosis of DMD 
in the presence of a pattern comparable to the DMD 
control pattern, a diagnosis of BMD in the presence of a 



WO 89/06286 

» ' -X. ' ■'■ ■ 



PCT/US88/04504 



- 43 - 



pattern comparable to the BMD control pattern, and a 
diagnosis of the absence of MD in the presence of a 
pattern comparable to said normal control. 

30. The method of any one of claims 24 to 29 
5 wherein said sample is obtained from a human adult. 

31. A method of therapy for MD comprising 
administering an amount of the polypeptide of any one of 
claims 11 to 15 or 17 claims effective to control muscle 
fiber degeneration and connective tissue proliferation 

10 within muscle fibers. 

32. A method of therapy for MD comprising the 

steps of, 

obtaining cells from an MD patient, 

inserting nucleic acid encoding dystrophin or a 

15 fragment thereof into a vector, 

introducing said vector into said cells, and 
reintroducing said cells into said patient in 
an amount effective to produce dystrophin sufficient to 
control muscle fiber degeneration and connective tissue 
20 proliferation within muscle fibers within said patient. 

33. A method for purification of dystrophin or 
a fragment thereof, comprising the steps of; 

providing an antibody specific for said 

dystrophin, 

25 binding said antibody to a solid phase, 

contacting said solid phase with a solution 
comprising said dystrophin, under conditions wherein* 
said dystrophin is bound to said solid phase, and 

washing said solid phase under conditions 
30 wherein said bound dystrophin is released from said 

solid phase. 

34. A method for producing dystrophin, or a 

fragment thereof, comprising the steps of, 
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providing nucleic acid encoding said dystrophin 

or said fragment, 

introducing said nucleic acid into a cell, and 
culturing said cell under conditions suitable 
5 to induce expression of said nucleic acid to produce 

said dystrophin. 

35. A method of assaying a sample of genomic 
nucleic acid of a human subject for the presence of a 
mutated MD gene comprising the steps of, 

10 denaturing the DNA of said sample and digesting 

said DNA with one or more restriction endonuc leases, 

contacting said denatured, digested DNA with a 
probe according to any one of claims 1 to 6, and 

detecting hybrid DNA complexes as probable 
IS indications of the presence of said MD gene in said 
sample . 

36. The method of claim 35 wherein said 
nucleic acid encodes a fusion polypeptide, wherein said 
dystrophin or said fragment is produced as a polypeptide 

20 having a peptide linked to a second polypeptide. 

37. The method of claim 36 wherein said second 
polypeptide comprises a portion of an L coli 
polypeptide, and said dystrophin fragment has a 
molecular weight of between 30 kD and 60 kD. 

25 38. A method for purification of an antibody, 

^herein said antibody is specific for a polypeptide, 
said method comprising the steps of: 

providing said polypeptide in insoluble form, 
contacting said polypeptide with said antibody 
30 to form an immune complex, 

washing said complex, and 

causing said antibody to be released from said 

complex. 
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39, The method of claim 38 wherein said 
polypeptide is fused to an insoluble peptide. 

40. The method of claim 7 wherein DNA in said 
sample is fixed to a substrate and exposed to a labelled 

5 probe. 
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TTCTGTCTTAGTGAAGTCTCAAT-ATTTTTCAATGTGATTAAAATTCATATTATGTCACC 
ATAGTTATGAAAAAAGTAAAAATATGTTTTGTATAAGTCTTAT-ATTTGATAATTTAAGC 
CAATTAACAAATCCTGACAGCTGGTGTAGAATTTGAGATAAAATTCCCCTGCTGACTCAG 

GCAGTGAATAAGAATTTAAXGTTCATTAAAATATGTCATAATCTGAGACCCAAAGCAG^ 
ACCATCTGTTTCCAGTTCCTACTACATGCCTAGGTATGGAATCCATGTTCTAAAATTCTT 
TTTAATTTACAACAAGTATAATTTAT1TCGAAATAGCATCATCAATAACCAGTGTAATTC 
TCGAATATTTTTGTTGGGATTGTTATCTGAGAAATGTGGTATTTCATTTCTTTACAA 



GCAAGTCATGAAATGGCTCATGCTTTTATTGCCATTTTGATGTTTTTGATGGCAAAAGTG 

rCGAGAAA-AAGTCTT-TAGATTCACGTGATAAGCTGACAGAGTGAAACATCTTAAGGCT 

TGAAAGGGCAAGTAGAAGTTATAATTATTGTGTAGATTCACAGTCCTTGTATTGAATTAG 

TCATCTTTGCTCTCATGCTGCAGGCCATAGAGCGAGAAAAAGCTGAGAAGTTCAGAAAAC 

TGCAAGATGCCAGCAGATCAGCTCAGGCCCTGGTGGAACAGATGGTGAATGGTAATTACA 

CG-AGTTGATTTAGATAATCTTCTTAGGGATTTGATAAACACATAGGTTCATATTTATCA 

GCTGAATTATATCAGACAAGCACTTGTTAAATACAAATTTAAATTAAAAGGTGTTTGTAT 

GTTTTTTATTATTCTT.TTTTTAATGCTAAGGAAATTATTAGGAGAAATTCAACTTTGAGT 

TCATTGGAAGAAAATGGGATGTGGTAGAATATTTTATCAGTCTGTAGCAGAGAAATAAAT 

TTTAATGCAAATCTG-CT-AGAATTTATCCAAATAATTTAAGAAATAAGGT-TAACAGAA 

ATTGAAACATTAACAGTCAAGTATA 
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FIG. 5 (2 0F4) 

••ATGGCCTCGCTTTGAATGCTCTCATCCATAGTCATAGGCCAGACCTATTTGACTGGAATAGTGTGCTTT 
TC^GCACAGGGATATGAGAGAACTTOT 

SctagaSgaaaug^ 

SSIataagtcj^^ 

™caaaaacS 

GGUCnCCAclicCACGTCCCCAAAAGMG 

GATCTTGATATAACTCM 
CAAT^CGGAAGGAAGGCAACTTCTCAGAm 

TGAGAAGWCAGAAAACTGCAAGATGCCAGCAGAICAGCTCAGGCCCTGGTGGAACAG 

GCTGTTAATGCA^ 
?gCTAAGTGAgS 

aSgagcagatgacaactactgctgaam 

GC^WAAAAGTCAGnAAAAATTTGTAAGGA^ 
AAMATTAAAUCTC^ 

TCTGGCCTTTACAAATCATTITAAGCAAGICirnCTGATGTGCAGGCCAGAGAGAAAGAGCTACAGACA 




CAGAAACCAAACTCTCCATACCTCAACTIAGTGTCACCGACTATGAAATCATGGAgcAGAGActcgGGGA 

AWG^GGC^KACAAAGTTCTCTGCAAGAGCAACAAAGTGGCCTATACTATCTCAGCACCACTGTGAAA 

GAGATGTCGAAGAAAGCGCCCTCTGAAATTAGCCGGAAATATCAATCAGAATTTGAAGA 

GCTGGAAGAAGCTCTCCTCCCAGCTGGTTGAGCATTGTCAAAAGCTAGAGGAGCAAATGAATAAACTCCG 

AAAAATTCAGAATCACATACAAACCCTGAAGAAATGGATGGCTGAAGTTGATGITTTTcTGAAGGAGGAA 

TGGCCtGCCCTTGGGGATTCAGAAATTCTAAAAAAGCAGCTGaaacagtgcagacttttagtcagtgata. 

ttcagacaattcAGCCCAGTCTAAACAGTGTCAATGAAGGTGGGCAGAAGATaaagaatgaagcagagcc 

agagtttgcttcgagacttgagacagaactcaaaga^^^ 

gtctatgccagaaaggaggccttgaagggaggtttggagaaaactGTAAGCCTCCAGAAAGATCTatcag 

agatgcacgaatggatgacacaagctgaagaagagtatcfctgagagagat^ 

tffaattacaBaaaecagttgaagagatgaagagagctaaagaagaggcccAACAAAAAGAAGCGAAAGTG 

SIctccttacSgtctgtaaatagtgtcatagctc^ 

A^AAGWGAAACTCTAACCACCAACTACCAGTGGCTCTGCACTAGGCTGAATGGGAAATGCAAGACTTT 

GGA^AAGTTTCGGCATGTTGGCATGAGTTATTGTCATACTTGGAGAAAGCAAAGAAGTGGCTAAATGAA 

GTAGAATTTAAACTTAAAACCACTGAAAACATTCCTGGCGGAGCTGAGGAAATCTCTGAGGTGCTAGATT 

CAC^AAAi^GATGCGACAnCA(5AGGATAACCCAAAI(^GmCGCATATTGGCACAGACCCTAAC 

ArATCGCGG^TCATGGATGAGCTAATCAATGAGGAACTTGAGACATTTAATTCTCGTTGGAGGGAACTA 

CATGAAGAGGCTGTAAGGAGGCAAAAGTTGCTTGAACAGAGCATCCAGTCTGCCCAGGAGACTGAAAAAT 

CCTTACACTTAATCCAGGAGTCCCTCACATTCAITGACAAGGAGTTGGCAGCTTATATTGCAGACAAGGT 

CGACGCAGCTCAAATGCCTCAGGAAGCCCAGAAAATCCAATCTGATTTGACAAGTCATGAGATCAGTTTA 

GAACAAATGAAGAAACATAATCAGGGGAAGGAGGCTGCCCAAAGAGTCCTGTCTCAGATTGATGTTGCAC 

AGAAAAA^TTACAAGATGTCTCCATGAAGTrTCGATTATTCCAGAAACCAGCCAATITTGAGCTGCGTCT 

ACAAGAAA.GTAAGATGATTTTAGATGAAGTGAAGATGCACTTGCCTGCATTGGAAACAAAGAGTGTGGAA 

CAGGAAGTAGTACAGTCACAGCTAAATCATTGTGTGAACTTGTATAAAAGTCTGAGTGAAGTGAAGTCTG 

AAGTGGAAATGGIGATAAAGACTGGACGTCAGATTGTAGAGAAAAAGCAGACGGAAAATCCCAAAGAACT 

TGAJGAAAGAGTAACAGCTTTGAAATTGCATTATAATGAGCTGGGAGCAAAGGTAACAGAAAGAAAGCAA 
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FIG. 5 (3 0F4) 

CAGTTGGAGAAA.TGCTTGAAATTGTCCCGTAAGATGCGAAAGGAAATGAATGTCTTGACAGAATGGCTGG 

CAGCTAOAGATATGGAATTGACAAAGAGATCAGCAGTTGAAGGAATGCCTAGTAATTTGGATTCTGAAGT 

TGCCTGGGGAAAGGCTACTCAAAAAGAGATTGAGAAACAGAAGGTGCACCTGAAGAGTATCACAGAGGTA 

GGAGAGGCCTTGAAAACAGTTTTGGGCAAGAAGGAGACGTTGGTGGAAGATAAACTCAGTCTTCTGAATA 

GTAACTGGATAGCTGTCACCTCCCGAGOAGAAGAGTGGTTAAATCTTTTGTTGGAATACCAGAAACACAT 

GGAAACTTTTGACGAGAATGTGGACCACATCACAAAGTGGATCATTCAGGCTGACACACTTTTGGATGAA 

TCAGAGAAAAAGAAACCCCAGCAAAAAGAAGACGTGCTTAAGCGTTTAAAGGCAGAACTGAATGACATAC 

GCCOAAAGGTGGACTCTACACGTGACCAAGOAGCAAACTTGATGGCAAACCGOGGTGACCACTGCAGGAA 

ATTAGTAGAGCCCCAAATCTCAGAGCTCAACCATCGATTTGCAGCCATTTCACACAGAATTAAGACTGGA 

AAGGCCTCCATTCCTTTGAAGGAATIGGAGCAGI1TAACTCAGATATACAAAAATTGCTTGAAGGACTGG 

AGGCTGAAAnCAGCAGGGGGTGAATCTGAAAGAGGAAGACTTCAATAAAGATATGAAIGAAGACAATGA 

CGGTACTGTAAAAGAATTGTTGOAAAGAGGAGACAACTTACAACAAAGAAICACAGATGAGAGAAAGAGA 

GAGGAAATAAAGATAAAACAGCAGCTGTTACAGACAAAAGATAATGCTCTOAAGGATTTGAGGTCTCAAA 

GAAGAAAAAAGGCTCTAGAAATTTGTCATCAGTGGTATCAGTACAAGAGGCAGGCTGATGATCTCCTGAA 

ATGCTTGGATGACATTGAAAAAAAATTAGCGAGCCTACCTGAGCCCAGAGATGAAAGGAAAATAAAGGAA 

ATTGATCGGGAATTGCAGAAGAAGAAAGAGGAGCTGAATGCAGTGCGTAGGCAAGCTGAGGGGTTGTCTG 

AGGATGGGGCCGCAATGGCAGTGGAGCCAACTCAGATCOAGCTOAGOAAGCGCTGGCGGGAAATTGAGAG 

CAAATTTGCTCAGTTTCGAAGAOTOAACTTTGGAOAAATTGAOACTGTCCGTGAAGAAACGATGATGGTG 

atgactgaagagatgcctttggaaatttcttatgtgccttctacttatttgactgaaatcactcatgtct 
oacaagccctattagaagtggaacaacttctcaatgctcctgacctctgtgctaaggactttgaagatct 
ctttaagcaagaggagtctctgaagaatataaaagaragtctacaacaaagctcaggtcggattgacatt 
attcatagcaagaagacagcagcattgcaaagtgcucgcctgtggaaagggtgaagctacaggaagctc 
tctcccagcttgatttccaaigggaaaaagitaacaaaatgtaoaaggaccgaoaagggcgatttgacag 
atctgttgagaaatggcggcgttttcattatgatataaagatatttaatcagtggctaacagaagctgaa 
cagtttctcagaaagacagaaattcctgagaattgggaacatgctaaataoaaatggtatcttaaggaac 
tgoaggatggcattgggcagcggcaaactgitgtcagaacattgaatgcaactggggaagaaataattca 
gcaatcctcaaaaacagatgccagtattctacaggaaaaattgggaagcctgaatctgcggtggcaggag 
gtctgcaaacagctgtcagacagaaaaaagagggtagmgaacaaaagaatatcttgtcagaatttcaaa 

gagatttaaatgaatttgttttatggttggaggaagaagataagattgctagtatcccacttgaacctgg 
aaaagagcagcaactaaaagaaaagcttgagoaagtcaagttactggtggaagagttgcccctgcgccag 
ggaattctcaaaoaattaaatgaaactggaggacccgtgcttgtaagtgctcccataagcccagaagagc 
aagataaacitgaaaataagctcaaggagacaaatctccagtggataaaggiitccagagctttacctga 
gaaacaaggagaaattgaagctcaaataaaagacgttgggcagcttgaaaaaaagcttgaagaccttgaa 
gagcagttaaatcatctgctgctgtggttatctcctattaggaatoagttggaaatttataaccaaccaa 
accaagaaggaccatttgacgttcaggaaactgaaataggagttcaaggtaaacaaccggatgtggaaga 

GATTTTGTCTAAAGGGCAGCATTTGTACAAGGAAAAACCAGCCACTCAGCCAGTGAAGAGGAAGTTAGAA 

GATCTGAGCTOTGAGTGGAAGGCGGTAAACCGTTTACTTCAAGAGCTGAGGGCAAAGCAGCCTGACCTAG 

CTCCTGGACTGACCACTATTGGAGCCTCTCCTACTCAGACTGTTACTCTGGTGACACAACCTGTGGTTAC 

TAAGGAAACTGCCATCTCCAAACTAGAAATGCCATCTTCCTTGATGTTGGAGGTACGTGCTCTGGCAGAT 

TTCAACCGGGCTTGGACAGAACTTACCGACTGGGTTTCTCTGCTTGATCAAGTTATAAAATCACAGAGGG 

TGATGGTGGGTGACCTTGAGGATATCAACGAGATGATCATCAAGCAGAAGGCAACAATGCAGGATTTGGA 

ACAGAGGGGTCCCCAGTTGGAAGAACTCATTACCGCTGCCCAAAATTTGAAAAACAAGACCAGCAATCAA 

GAGGCTAGAACAATCATTACGGATCGAATTGAAAGAATTCAGAATCAGTGGGATGAAGTACAAGAACACG 

TTCAGAACCGGAGGCAACAGTTGAATGAAATGTTAAAGGATTCAACACAATGGCTGGAAGCTAAGGAAGA 

AGCTGAGCAGGTCTTAGGACAGGCCAGAGCCAAGCTTGAGTCATGGAAGGAGGGTCCCTATACAGTAGAT 

GGAATCCAAAAGAAAATOACAGAAACCAAGCAGTTGGCCAAAGACCTCCGCCAGTGGCAGACAAATGTAG 

ATGTGGCAAATGACTTGGCCCTGAAACTTCTCCGGGATTATTCTGGAGATGATACCAGAAAAGTCCACAT 

GATAACAGAGAATATCAATGCCTCITGGAGAAGCATTCATAAAAGGGTGAGTGAGCGAGAGGCTGGTTTG 

GAAGAAACTCATAGAnACTGCAACAGlTCCCCCTGGACCTGGAAAAGTTTCTTGCCTGGCTTACAGAAG 

CTGAAACAACTGCCAATGTCCTACAGGATGCTACCCGTAAGGAAAGGCTCCTAGAAGACTCOAAGGGAGT 

AAAAGAGCTGATGAAACAATGGCAAGACCTCCAAGGTGAAATTGAAGCTCACACAGATGTTTATCACAAC 

CTGGATGAAAACAGCCAAAAAATCCTGAGATCCCTGGAAGGTTCCGATGATGCAGTCCTGTTACAAAGAC 

GTTTGGATAACATGAACTTCAAGTGGAGTGAACTTCGGAAAAAGTCTCTCAACATTAGGTCCCATTTGGA 

AGCCAGTTCTGACCAGTGGAAGCGTCTGCACCTITCTCTGCAGGAACTTCTGGTGTGGCTACAGCTGAAA 

GATGATGAATTAAGCCGGCAGGCACCTATTGGAGGCGACTTTCCAGCAGTTCAGAAGCAGAACGATGTAC 

ATAGGGCCTTCAAGAGGGAATTGAAAACTAAAGAACCTGTAATCATGAGTACTCTTGAGACTGTACGAAT 

ATTTCTGACAGAGCAGCCTTTGGAAGGACTAGAGAAACTCTACCAGGAGCCCAGAGAGCTGCCTCCTGAG 

GAGAGAGCCCAGAATGTCAGTCGGCTTCTACGAAAGCAGGCTGAGGAGGTCAATACTGAGTGGGAAAAAT 

TGAACCTGCACTCCGCTGACTGGCAGAGAAAAATAGATGAGACCCTTGAAAGACTCCAGGAACTTCAAGA 

GGCCACGGATGAGCTGGACCTCAAGCTGCGCCAAGCTGAGGTGATGAAGGGATCCTGGCAGCCCGTGGGC 
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GATCTCCTCATTGACTCTCTCCAAGATCACCTCGAGAAACTCAAGGCACTTCGAGW^ 
TGAAAGAGMCGTGAGCCACGTCAATGACCTTGCTCGCCAGCTTACCACTn 

GTCAGGCAGCTGCAT^ 

WGCTGGGACCATCCCAAAATGACAGAGCTCTACCAGTGTTTAGOTGAOOTGAATA^ 
GCTTATAGGACTGCCATGAAACTCCGAAGACTGCAGAAGGCCCnTGCTTGGA 
CTGCATCTGATGCCTTGGACCAGCACAACCTOAAGCAAAATGACCAGCCCATGGATATCCT 
TAATTGTTTGACCACTATTTATGACCGCCTGGAGCAAGAGCACAACAATTTGGTCAACGTCCCTCTCTGO 

GTGGATATGTGTCTGAACTGGCTGCTGAATGTTTATGATACGGGACGAACAGGGAGGATCCGTGTCCTGT 
CTTTTAAAACTGGCATCATTTCCCTGTGTAAAGCACATTTGGAAGACAAGTACAGATACCTTTTCAAGCA 
AGTGGCAAGnCAACAGGATTTTGTGACCAGCGCAGGCTGGGCCTCOnCTGCATGATTCTATCCAMTT 
CCAAGACAGTTGGGTGAAGTTGCATCCTTTGGGGGCAGTAACATTGAGCCAAGTGTCCGGAGCTGCTTCC 
AATTTGCTAATAATAAGCCAGAGATCGAAGCGGCCCTCnCCTAGACTGGATGAGACTGGAACCCCAGTC 
CATGGTGTGGCTGCCCGTCCTGCACAGAGTGGCTGCTGCAGAAACTGCCAAGCATCAGGCCAAATGTAAC 
ATCTGCAAAGAGTGTCCAATCATTGGATTCAGGTACAGGAGTCTAAAGCACTTTAATTATGACATCTGCC 
AAAGCTGCAAAAAAACTGGTCGAGTTGCAAAAGGCCATAAAATGCACTATCCOATGGTGGAATATTGCAC 
TCCGACTACATCAGGAGAAGATGnCGAGACTTTGCCAAGGTACTAAAAAAOAAATnCGAACCAAAAGG 
TATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGOAGACTGTCTTAGAGGGGGACAACATGGAAA 
CTCCCGTTACTCTGATCAACTTCTGGCOAGrAGATTCTGCGCCTGCCTCGTCCCCTCAGCTTTCACACGA 
TGATACTGATTCAOGCATXGAACATTATGCTAGCAGGCTAGOAGAAATGGAAAACAGCAATGGATCTTAT 
CTAAATGATAGCATGTCTCCTAATGAGAGCATAGATGATGAACATTTGTTAATCCAGCATTACTGCCAAA 
GTTTOAAOCAGGAOTCCCCCCTGAGCCAGCCTCGTAGTCGTGCCCAGATCTTGATTTCCTTAGAGAGTGA 
GGAAAGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAGGAAGAAAACAGGAATCTGCAAGCAGAATAT 
GACCMCTAAAGCAGCAGCACGAACATAAAGGCCTGTCCCCACTGCCGTCCCCTCCTGAAATGATGCCCA 
CCTCTCCCOAGAGTCCCCGGGATGCTGAGCTCATTGCTGAGGCCAAGCTACTGCGTCAACACAAAGGCCG 
CCTGGAAGCCAGGATGOAAATCCTGGAAGACCACAATAAACAGCTGGAGTCACAGTTACACAGGCTAAGG 
CAGCTGCTGGAGCAACCCCAGGCAGAGGCCAAAGTGAATGGCACAACGGTGTCCTCTCCTTOTACCTCTC 
TACAGAGGTCCGACAGCAGTCAGCCTATGCTGCTCCGAGTGGTTGGOAGTCAAACTTCGGACTCCATGGG 
TGAGGAAGATCTTCTCAGTCCTCCCCAGGACACAAGCACAGGGTTAGAGGAGGTGATGGAGCAACTCAAC 
AACTCCTTCCCTAGTTOAAGAGGAAGAAATACCCCTGGAAAGCCAATGAGAGAGGACACAATGTAGGAAG 
TCTTTTCCACATGGCAGATGATTTGGGCAGAGCGATGGAGTCCTTAGTATCAGTCATGACAGATGAAGAA 
GGAGCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGGTTTTTATAATATTCATACAACAAAGAGGA 

TTAGACAGTAAGAGTTTA^ 

TCACTAGTTTCTAAGTCTGTTATrGTTTTGTTAACAATGGCAGGTTTTACACGTCTATGCAATTGTACAA 

AAAAGWATAAGAAAACTACATGTAAAATCTTGATAGCTAAATAACTTGCCATTTCTTTATATGGA^ 

ATTMGGGTTGTTTAAAAATTTATAACAGTTATAAAGAAAGATTGTAAACTAAAGTGTGCTTTATAAAAA 

UAGTTGTTTATAAAAACCCCTAAAAACAAAACAAACACACACACACACACATACAGACACACACACAAA 

ACTTTGAGGCAGCGCATTGTTTTGCATCCTTTTGGCGTGATATCCATATGAAATTOATGGCTTTTTCTTT 

TTTTGCATATTAAAGATAAGACTTCCTCTACCACCAOACCAAATGAOTACTACACACTGCTCATTTGAGA 

ACTGTCAGCTGAGTGGGGCAGGCTTGAGTTTTCATTTCATATATCTATATGTCTATAAGTATATAAATAC 

TlTArTTATATAGATAAAGAGATACGAATTTCTATAGACTGACTTTTTCCATTTTTTAAATGTTCATGTC 

ACATCCTAATAGAAAGAAATTACTrCTAGTCAGTCATCCAGGCTTACCTGCTTGGTCTAGAATGGATTTT 

TCCCCGAGCCGGAAGCCAGGAGGAAACTACACCACACTAAAACATTGTCTAGAGCTCCAGATGTTTCTCA 

^TAAACAACmCCACTGACAACGmGTAAAGTAAAGTAmGATTTTmAMGGGAACATGTGAA 

TriA^ACAGGACTTATTATATCAGAGTGAGTAATCGGTTGGTTGGTTGATTGATTGATTGATTGATAC 

AT^AGCTOCCTGCTGCTAGCMTGCCACGATTTAGATTTAATGATGCTTCAGTGGANNTCAATCAGAAG 

CTmCTGACCTTGTGAACATCAGAAGGTATTTTTTAACTCCCCUGCAGTAGCAGGACGATGATAGGGC 

TrrAPGGCTATGGATTCCCAGCCCATCCCTGTGAAGGAGTAGGCCACTCTTTAAGTGAAGGATTGGATGA 

TOGTTCATAATACATAAAGTTCTCTGTAATTACAACTAAATTATTATGCCCKCTTCTCACAGTCAAAAGG 

AACTGGGTGGTTTGGTTTTTGNN^^^TTTTTTAGATTTTTTGTCCCATGTGGGATGAGTTTTTAAATGCCA 

CAAGACATAATTTAAAATAAATAAACTTTGGGAAAAGGTGTAAGACAGTAGCCCCATCACATTTGTGATA 

CTGACAGGTATCAACCCAGAAGCCCATGAACTGTGTTTCCATCCTTTGCATTTCTCTGCGAGTAGTTCCA 

CACAGGTTTGTMGTAAGTAAGAAAGAAGGCAAATTGATTCAAATGTTACAAAAAAACCCTTCTTGGTGG 

ATTAGACAGGTTAAATATATAAACAAACAAACAAAAATTGCTO 
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GCCCTGGGGGATGCTGAAATCCTGAAAAAACAGCTCAAACAATGCAGACTTTTAGTTGGT 
- - -ILKKQLKQCRLLVG 



A L G 0 
A L G D 
GCCCTTGGGGATTCA 



E 
E 



L K K Q L K Q 



R 



V 



GAAATTCTAAAAAAGCAGCTGAAACAGTGCAGACTTTTAGTCAGT 



GATATTCAA^CAATTCAGCCCAGTTTAAATAGTGTTAATGAAGGTGGGCAGAAGATAAAG 
nTOTlQPSLNSVNEGGQKl.iv 
DIQT U LNSVNEGGQK 



K 



gatattcagLaattcagcccagtctaaacagtgtcaatgaaggtgggcagaagataaa^ 



FIG.7B-4 
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* * 



* * * ** * * * 

AGTGAAGCTGAACTTGAGTTTGCATCCAGACTGGAGACAGAACTTAGAGAGCTTAACACT 

EFASRLETEL 
EFASRLETEL 



S 
N 



E A E 
E A E 



L 
P 



R 
K 



E L 
E L 



N 
N 



T 
T 



AATGAAGCAGAGCCAGAGTTTGCTTCGAGACTTGAGACAGAACTCAAAGAACTTAACACT 



** 



CAGTGGGATCACATATGCCGCCAGGTCTA^CAGAAAGGAAGCCTTAAAGGCAGGTTTG 

cIrIq V Y 



Q W D H 
Q W D H 



I 
M 



C 1 Q I Q V Y 



T 
A 



R K E A L K 
R K E A L K 



A 
G 



G L 
G L 



CAGTGGGATCACATGTGCCAACAGGTCTATGCCAGAAAGGAGGCCTTGAAGGGAGGTTTG 

* * * * * 

GATAAAACCGTAAGCCTCCAAAAAGATCTATCAGAGATGCATGAGTGGATGACACAAGCT 
KTVSLQKDLSEMHEWMTQA 
KTVSLQKDLSEMHEWMTQA 
GAGAAAACTGTAAGCCTCCAGAAAGATCTATCAGAGATGCACGAATGGATGACACAAGCT 

* * ** * 

GAAGAAGAATATCTAGAGAGAGATTTTGAATATAAAACTCCAGATGAATTACAGACTGCT 
EEEYLERDFEYKTPDELQ 
EEEYLERDFEYKTPDELQ 




T 
K 



A 
A 



GAAGAAGAGTATCTTGAGAGAGATTTTGAATATAAAACTCCAGATGAATTACAGAAAGCA 



GTTGAAGAAATGAAGAGAGCTAAAGAAGAGGCACTACAAAAAGAAACTAAAGTGAAACTC 
VEEMKRAKEEALQKETKVKL 
VEEMKRAKEEA 717 
GTTGAAGAGATGAAGAGAGCTAAAGAAGAGGCCC 2153 

CTTACTGAGACTGTAAATAGTGTAATAGCTCACGCTCCACCCTCAGCACAAGAGGCCTTA 

LTETVNSVIAHAPPSAQEAL 

AAAAAGGAACTTGAAACTCTGACCACCAACTACCAATGGCTGTGCACCAGGCTGAATGGA 

KKELETLTTNYQWLCTRLNG 

AAATGCAAAACTTTGGAAGAAGTTTGGGCATGTTGGCATGAGTTATTGTCATATTTAGAG 

KCKTLEEVWACWHELLSYLE 

AAAGCAAACAAGTGGCTCAATGAAGTAGAATTGAAACTTAAAACCATGGAAAATGTTCCT 

KANKWLNEVELKLKTMENVP 

GCAGGACCTGAGGAAATCACTGAAGTGCTAGAATCTCTTGAAAATCTGATGCATCATTCA 

AGPEEITEVLESLENLMHHS 

GAGGAGAACCCAAATCAGATTCGTCTATTGGCACAGACTCTTACAGATGGAGGAGTCATG 

EENPNQIRLLAQTLTDGGVM 

GATGAACTGATCAATGAGGAGCTTGAGACGTTTAATTCTCGTTGGAGGGAACTACATGAA 

DELINEELETFN SRWRELHE 

GAGGCTGTGAGGAAACAAAAGTTGCTTGAACAGAGTATCCAGTCTGCCCAGGAAATTGAA 

EAVRKQKLLEQS IQSAQEI E 

AAGTCCTTGCACTTAATTCAGG AGTCGCTTGAATTC 3273 

KSLHLIQESLEF 



Nucleic Acid Homology (2153, 1885) 87.552% 
Amino Acid Homology (717, 621) 86.611% 

FIG7B-5 
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FIG. 8 

MLWWEEVEDCYEFJIDVQKKTFTKWVNAQFSKFGK^ 

VHALNNVNKALRVLQNNNVDLVNI GSTDI VDGNHKLTLGLI WNI I LHWQVXNVMKNIMAGLQQTNSEKIL 
]^;^QSTRNYFQVNVINFTTSWSDGLALNALIE^ 

KLLDPED\n3TTYPDKKSIIMYITSLFQVl^QQVSIEMQEVEMLPRPPKVTKEEHFQLHHQMHYSQQITV 
S1AQGYERTSSPKPRFKSYAYTQAAYVTTSDPTRSPFPSQHLEAPEDKSFGSSLMESEVHLDRYQTALEE 
VLSWLLSAEDTLQAQGEISNDVEVVKDQFHTHEGYMMPLTAHQGRVGNILQKjSKLIGTGKI^ 

0EQMNLI2ISRWECraVASMEKQSinjmVIMDLQ^ 

QQHKVLQE DLEQEQVRVNSLTHMVVVVDESSGDHATAALEEQLKVIXSDRWANICRWTEDRWVLI^DIIXK 
WORLTEEQCLFSAWLSEKEDAVNKIHTTGFKDQNEMLSSLQKLA VLKADLEKKKQSMGKLYSLKQDLLST 
LKNKSVTQKTEAWLDNFARCWDNLVQKLEKSTAQ ISQAVTTTQPSLTQTTVMETVTTVTTREQILVKHAQ 
EELPPPPPQKKRQITVDSEIRKRIJ3VDITEIJISWITRSEAV1QSPEFAIFRKEGNFSDLKEKVN AIEREK 

AEKFRKLQDASRSAQALVEQMVNEGVNADSIKQASEQ1MSRW1EFCQLLSER12IWI£YQNNIIAFYNQLQ 

QLEQMTTTAENWLKIQPTTPSEPTAIKSQ1KICK DEVNRLSGLQPQIERLKIQSIALKEKGQGPMFLDAD 
FVAFTNHFKQVFSDVQAREKEI^TIFDTLPPMRYQETMSAIRTWQQSETKLSIPQI^VTDYEIMEQRI^ 
ELQ AIiQSSLQEQQSGLYYLSTTVKEMSKKAPSEISRKYQSEFEEIEGRWKKI5SQLVEHCQKLEEQMNKI« 
RKIQNHIQTLKKWMAEVDVFLKEEWPALGDSEILKTOIjKQCR liVSDIQTIQPSLNSVNEGGQKIKNEAE 
PEFASRLETELKELNTQWDHMCQQVYARKEALRGGLEKTV^ 

DELQKAVEEMK RAKEEAQQKEAKVKLLTESVNSVIAQAPPVAQEALKKEI^TLTrNYQWLCTRLNGKCKT 
LEEVWACWHELLSYLEKANKWI21EVEFKIJOTTEOTPGGAEEISEVLDSLE NLMRHSEDNPNQIRILAQTI* 
TDGGVMDELINF^XETFNSRWREIIffiEAVRRQKLLEQ 

VDAAQMPQEAQKIQSDLTSHEISIJBEMKKHNQGKEAAQRVliSQIDVAQKKI^DVSMKFRLFQKPANFEIJl 
LQESKMILDEVKMHLPALETKSVEQEWQSQLNHCV NLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKE 
LDERVTALKIflYNELGAKVTERKQQLEKCLRTiSRKIQl^^ 

VAWGK ATQKEIEKQKVHLKSITEVGFALKTVLGKKETLVEDKLSLLNSNWIAVTSR 
METFDQNVDHITKWI IQADTLLDESEKKKPQQKEDVLKRLK AEXNDIRPKVDSTRDQAANLMANRGDHCR 
KLVEPQISELNHRFAAISHP*IOTGKASIPLKELEQFNSDIQKIXEPLEAEIQ 
EGTVKEIJX2RGDNLQQRITDERKREEIKIKQQIJ^TKHNALKDI^ 

KCLDDIEKKLASLPEPP^ERKIICEIDREIiQKSa^EINAVRRQAEGIjSEDGAAMAVEPTQIQLSKRWREIE 
SKFAQFRRIJIFAQIHTVREETMMVMTEDMPIiEISYVPSTYLTEITHVSQA^ 

LFKQEESLK NIKDSI^QSSGRIDIIHSKKTAALQSATPVERVKLQEALSQLDFQWEKVKKMYKDRQGRFD 
RSVEKWPJ^HYDIKIFNQWLTEAEQFIEKTQIPENWEHAKYKWYIjKEIiQ DGIGQRQTWRTLNATGEEI I 
QQSSKTDASILQEKLGSIKl^WQEVCKQLSDRKKRLEEQKNILSEFQRDLNEFVLWIiEEADNIASIPIiEP 
GKEQQLKEKLEQVK IiVEELPI^QGILKQLNETGGFTLVSAPISPEEQDKI^NKIiKQTNI^WIKVSRALP 
EKQGEIEAQIKDLGQI£KKLEDLEEQIJIHLLLWIjSPIRNQLEIYNQPNQEGPFDVQETEIA VQAKQPDVE 
EILSKGQHLYKEXPATQPVKRKLE DLSSEWKAVNRLWELRAKQPDLAPGLTTIGASPTQTVTLVTQPVV 
TKETAJSKLFJ^PSSIMLEVPALADFNRAWTELTDWLSIiDQVIKSQRVlWGDIXDINFJlIIKQK ATMQDL 
' EQRRPQLEELITAAQNIi^NKTSNQEARTIITDRIERIQNQWDEVQEHI^NRRQQLNEMLKDSTQWLEAKE 
EAEQVLGQARAKLESWKEGPYTVD AIQKKITETK QLAKDLRQWQTNVDVANDLALKLLRDYSADDTRKVl 
MITENINASWRS IHKRVSEREAAI£ETHRLI^QFPLDLEKFLAWLTEAETTANVLQDATRKERLLEDS KG 
VK EIMKQWQDLQGEIEAHTDVXHHLBE^^ 

EAS S DQWKRLHLSLQELLVWLQLKDDELSRQAPIGGDFPAVQKQNDVH RAFKRELKTKEPVIMSTLETV I 

IFLTEQPLEGLEKLYQEPRELPPEERAQNVTRLIJ^QAEEVNTEWEKliNLHSADWQRKIDETLERI/2E 

EATDELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVK ALRGEIAPLKENVSHVNDLARQLTTLGIQLS 

PYNISTLEDmTRWKIJ^aVAVEDRVRQLHEAHRDFGPASQHFLSTSVQGPWERAISPNKVPYYINHETQT 

TCWDHPKMT ELYQSIAJSLNNVRFSAYRTAMKLRRLQK^LCLDLLSLSAACDALDQHHLKQNDQPMDILQI 

INCLTTIYDRLEQEH1INLV1TVPLCVDMCLMWIXNVYDTGRTGRIRVLSFKTGIISLCKAHLEDKYRYLFK 

QVASSTGFCDQRRLGLL1IIDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMRLEPQ 

SMVWLPVIimVAAAETAKHQAKCNICKECPIIGFRYRSI^FNYDICQSCKKTGRVAKGHl^YPMVEYC 

TPTTS GEDVRDFAKVl^KFRTKRYFAKHl>RMGYrJ»VQTVIXGDl^TPVTLINFWP7DS APASS PQLSH 

DDTHSRIEHYASRLAEMENSNGSYLKDSISPNESIDDEHLL1QHYCQSLNQDSPLSQPRSPAQILISLES 

EERGELERILADI£EENRNLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKG 

RLEARMQII£DHNKQI£SQLHRLRQIXEQPQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRWGSQTSDSM 

GEEDLLSPPQDTSTGLEEVMEQLNNSFPSSRGRNTPGKPMREDTM1 
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